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COMPOSITIONS AND METHODS FOR DETERMINING THE 
SUSCEFllBILITY OF A PATHOGENIC VIRUS TO PROTEASE INHIBITORS 



1. FIELD OF INVENTION 

This invention relates to conq)Ositions and methods for determining the susceptibility 
of a pathogenic virus to an anti-viral compound. The compositions and methods are usefiil 
5 for identifying effective drug regimens for the treatment of viral infections, and identi^ring 
and detennining the biological effectiveness of potential therapeutic conq>ounds. 

2. BACKGROUND OF THE INVRNTION 

More than 60 million people have been infected with the human immunodeficiency 
virus CBIVO, the causative agait of acquired immune deficiency syndrome ("AIDS*0, since 
10 the early 1980s. SeeLucas, 2002, Lepr Rev. 73(1 ):64.71. HIV/AIDS is now die leading 
cause of death in sub-Saharan Afiica, and is the fourth biggest killer worldwide. At the end 
of 2001, an estimated 40 million people were living with HIV globally. See Nonis, 2002, 
Radiol Technol. 73(4):339-363. 

Modem anti-HIV dmgs target different stages of the HIV life cycle and a variety of 
15 enzymes essential for HIV's replication and/or survival. Amongst the dmgs that have so far 
been approved for AIDS therapy are nucleoside reverse transcriptase inhibitors such as AZT, 
ddl, ddC, d4T, 3TC, abacavir, imcleotide reverse transcriptase inhibitors such as tenofovir, 
non-nucleoside reverse transcriptase inhibitors such as nevinq>]ne, efavirenz, delavirdine and 
protease inhibitors such as saquinavir, ritonavir, indiiuivir, nelfinavir, amprenavir and 
20 lopinavir. 

One consequence of the action of an anti-viral drug is that it can exert sufBcient 
selective pressure on virus replication to select for drug-resistant mutants (Henmaim et al^ 
1977. Ann NY Acad Sci 284:632-637). With increasing drug exposure, the selective pressure 
on the replicating virus population increases to promote the more rapid emergence of drug 
25 resistant mutants. 

With the inevitable emergence of drug resistance, strategies must be designed to 
optimize treatment in the &ce of resistant virus populations. Ascertaining the contribution of 
drug resistance to dmg failure is difiBcult because patients that are likely to develop dmg 
resistance are also likely to have other factors that predispose them to a poor prognosis 
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(Ridunan. 1994, AmS Res Hum Retroviruses 10:901-905). In addition, each patient 
typically haibors a diveise mixture of mutant strains of the virus with different mutant strains 
having different susceptibilities to anti-viral drugs. 

The traditional tools available to assess anti-viral drug resistance are inadequate; the 
5 classical tests fbr deteiminmg ihe resistance of HIV to an anti-viial agent are complex, 

time-consuming, expensive, potentially hazardous and not custom tailored to the treatment of 
a given patient See Bane^inoussi et al., 1983. Science 220:868-871; Popovic et aL, 1984. 
Science 224:497-500), and variations of it (see. e.g., Goedert et aL, 1987, JAMA 
257:331-334; Allain et al., 1987. N. Engl. J. Med. 317:1 1 14-1121; Piatak et al., 1993, Science 
10 259:1749-1754; Urdea. 1993, Clin. Chem. 39:725-726; KeUam and Larder. 1994, 
Antimicrobial Agents and Chemo. 3823-30. 

Two gaieial ^roaches are now used for measuring resistance to anti-viral drugs. 
The first, called phenotypic testing, direcfly measures the susceptibility of virus taken finom 
an infected person's virus to particular anti-viral dmgs. Petropoulos et al., 2000, Antimicrob. 
15 Agents Chemother. 44:920-928 and Hertogs et al., 1998, Antimicrob Agents Chemother 
42(2):269-76 provide a description of phenotypic assays in widespread use today. Gunthard 
et al, 1998, AIDS Res Hum Retroviruses 14:869-76 and Schuurman et al., 1999, J Clin 
Microbiol. 37:2291 -96 discuss currently prevalent genotypic assays. Hirsch et al. , 2000, 
JAMA 283:2417-26 provide a general analysis of the currentiy available assays for testing 
20 druig suscqitibility. 

The second method, caUed genotypic testing, detects mutations in the virus that affect 
drug susceptibiUty and can associate specific genetic mutations with drug resistance and drug 
feihms. Genotypic testing examines virus taken ftom a patient, looking for (he presence of 
specific genetic mutations that are associated with resistonce to certoin drugs. Genotypic 
25 testing has a few advantages over phenotypic testing, most notably tiie relative simpUcity and 
speed wifli which tiie test can be performed. The testing can take as Uttte as a few days to 
complete, and because it is less complex, it is somewhat cheaper to perform. However, 
interpretation of genotypic data is dependent on previous knowledge of tiie relationships 
between ^ecific mutations and dianges in drag susc^tibility. 
30 Efforts to date to use genotypic conrelates of reduced susceptibility to predict the 

effectiveness of anti-viral drugs, especially drugs targeted against the ever-evolving HIV are. 
at best, imperfect An algoritiun that can more accurately predict whether a given anti-viral 
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drug or combination of drugs would be effective in treating a given patient would save time 
and money by identifying dnigs that are not likely to succeed before they are administered to 
the patient More importantly, it would improve the quality of life of the patient by sparing 
him or her the trauma o f treatment with potent toxins that result in no improvement with 
5 respect to his or her HIV infection. Therefore, an urgent need exists for a more accurate 
algorithm for predicting whether a particular drug would be effective for treating a particular 
patient. Moreover, a genotype based assay can be faster and more cost effective dian 
phenotypic assays. 

3. SUMMARY OF THE INVENTION 

1 0 The present invention provides methods and compositions for developing and using 

algorithms for determining the effectiveness of an anti-viral ther^y or combination of 
fherqiies. The al^rithms are based on an analysis of paired phenotypic and genotypic data 
guided by phraotypic clinical cut-of & (the point at which resistance to a therapy begins and 
sensitivity ends). The algorithms significantly inq>rove the quality of life of a patient by 

1 S accurately predicting v^ether a given anti-viral drug would be effective in treating tbe 

patient, thereby sparing him or her the trauma of treatment with potent toxins ibat result in 
no improvement in his or her HIV infectioa 

In one aq[iect, the present invention provides algorithms that allow one to provide a 
patient with an effective treatment regunen by predicting whether an infected individual will 
20 respond to treatment with an anti-viral agent or combination of agents, thereby allowing an 
effective treatment regunen to be designed without subjecting the patient to unnecessary side 
effects. Also, by avoiding the administration of ineffective drugs, considerable time and 
money is saved. 

In another aspect, the present invention provides methods for determining the 
25 suscq>tibility of a virus to an anti-viral treatment, comprising detecting, in the viral genome 
or viral enzymes, the presence or absence of mutations associated with reduced susceptibility 
to the anti-viral treatmmt. 

In another aspect, the present invention provides methods for determining the 
effectiveness of an anti-viral treatment of an individual infected with a virus, conq)rising 
30 detecting, in a sample from said individual, the presence or absence of mutations associated 
widi reduced susceptibility to the anti-viral treatment. 
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The present invention also provides metbods of monitoring the clinical progression of 
viral infection in individuals receiving an anti-viral treatment by detennining, as described 
above, the effectiveness of the same or a different anti-viral treatment 

In one embodiment, the present invention provides nucleic adds and polyp^tides 
5 comprising a mutation in the protease of a human immunodeficiency virus nn>n 

associated with reduced 8usc«jptibiUty to a protease inhibitor. Examples of protease inhibitors 
include, but are not limited to, saquinavir, ritonavir, indinavir, nelfinavir. amprenavir and 
lopinavir. 

In one aspect, the invention provides a method for dctenmning ¥*eflier a human 
10 immunodeficiency virus (HIV) has an increased likelihood of having reduced suscq^ 

to treatment with a protease inhibitor, comprising: detecting whether the protease encoded by 
said HIV exhibits the presence or absence of one or more HIV protease mutations listed in 
Table 1; and applying a set of rules to said mutations as provided in Table 4; wherein said 
HIV has an increased likelihood of being resistant to treatment with said protease inhibitor if 
IS said set ofrules is satisfied. 

In another aspect, the invention provides a method for determining whether an 
individual infected with a human immunodeficiency virus (HIV) has an increased likelihood 
of having reduced susceptibiUty to treatment with a protease inhibitor, con^sing: detecting, 
in a sample fiom said individual, the presence or absence of one or more HIV protease 
20 mutations listed in Table 1; and aqiplying a set ofrules to said mutations as provided in 
Table 4; wherein said individual has an increased likelihood of being resistant to treatment 
wifli said protease inhibitor if said set of rules is satisfied. 

In another aspect, the invention provides a method for determining whether a HIV has 
an increased likelihood of having a reduced susceptibility to treatment with a protease 
25 inhibitor, comprising: detecting whether die protease encoded by said HIV exhibits the 
presence or absence of a mutation associated with reduced susceptibility to treatment with 
said protease inhftitor at amino acid position 1 1, 32, 33, 34. 43, 46. 47. 48, 50. 54. 58, 71, 76, 
79, 82, 83, 84. 91 or 95 of an amnio acid sequence of said protease, wherein die presence of 
said mutation indicates that the HIV has an increased likelihood of having reduced 
30 susceptibiUty to treatment with the protease inhibitor, with the proviso that said mutation is 
not V32I, M46I. M46L, I47V, I50V, I54L. I54M or I84V. 
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In another aspect, the invention provides a method of detennining whether an 
individual infected with HIV has an increased likelihood of having a reduced susceptibility to 
treatment with a protease inhibitor, con^rising: detecting, in a sample from said individual, 
(he presence or absence of a mutation associated with reduced susceptibility to treatment with 
5 said protease inhibitor at amino acid position 1 1, 32, 33, 34, 43, 46, 47, 48, S0> 54, 58, 71, 76, 
79, 82, 83, 84, 91 or 95 of the amino add sequence of the protease of the HIV, wherein the 
presence of said mutation indicates that the individual has an increased likelihood of having 
reduced suscq>tibility to treatment with the protease inhibiUxr, with the proviso that said 
mutation is not V32I, M46I, M46L, I47V, I50V, I54L, I54M or I84V. 

10 In anottier aspect, the invration provides a method of determining whether a HIV has 

an increased likelihood of having a reduced susceptibility to treatment with a protease 
inhibitor, comprising: detecting whether the protease encoded by said HTV exhibits the 
presence or absence of a mutation selected fiom the group consisting of: VI II, V I IL, L33F, 
E34Q, K43T, G48M, I54A, I54S, IS4T, Q58E, A71L, L76V, P79, V82A, V82F. N83D, 

15 I84A, I84C, T91A, T91S, T91 V and C95F, wherein the mutation is associated with reduced 
susceptibility to treatment widi said protease inhibitor and the presence of said mutation 
indicates that the HIV has an increased likelihood of having reduced suscq)tibility to 
treatment with the protease inhibitor 

In a preferred embodiment, the protease inhibitor is amprenavir. 

20 In another preferred embodiment, the human immunodeficiency virus is hpnan 

unmunodeficiency virus type 1 (•'HlV-r')- 

In another aspect, the invoition provides an oligonucleotide between about 10 and 
about 40 nucleotides long encoding a portion of a HIV protease that con^trises a mutation at 
amino acid position 1 1, 32, 33, 34, 43, 46, 47, 48, SO, 54, 58, 71, 76, 79, 82, 83, 84, 91 or 95 
25 of an ammo add sequence of said protease in said human iizununodefidency virus, wherein 
the mutation is associated with reduced suscq>tibility to a protease inhibitor, with ihis proviso 
that said mutation is not V32I, M46I, M46L, MTV, I50V, I54L, IS4M or I84V. 

In anoth^ embodiment, the invention provides an isolated polypeptide that comprises 
at least ten contiguous residues of the amino acid sequence of SEQ ID NO: 1 , wherdn the 
30 polypeptide comprises at least one mutation of the invention listed above, and wherein the 
mutation is associated with reduced susceptibility to a protease inhibitor. In a particular 
embodiment, the protease inhibitor is amprenavir. 
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In another embodiment, the polypeptide comprising said mutation or mutations is at 
least 70%, but less than 100%. identical to a polypeptide having the amino add sequence of 
SEQ ID N0:1; the polypeptide has an amino add sequence that is greater than 80% identical 
to the amino add sequence of SEQ ID N0:1; or the polypeptide has an amino arid sequence 

5 that is greater than 90% identical to the amino acid sequence of SEQ ID N0:1; wherein the 
mutation is assodated with reduced suscqrtibility to a protease inhibitor. 

In one embodiment, the invention provides a method wherein the presence or absence 
of a mutation in aprotease is detected by hybridization with a sequence-specific 
oUgonucleotide probe to a nucldc add sequence of human immunodefidcncy virus encoding 

10 said mutation, wherein the occurrence of hybridization indicates said presence or absence of 
said mutation. 

In another embodiment, the invmtion provides a method wherein ttie presence or 
absence of a mutation in a protease is detected by detennining a nucleic add sequence 
encoding said mutation. 

15 In another embodiment, the invention provides a method wherein tfie presence or 

absence of a mutation in a protease is detected by ampUfying tiie nucleic acid by, for 
example, polymerase chain reaction. 

In one embodiment, the individual is undergoing or has undergone prior treatment 
with an anti-viral drug. In another embodfanent, the anti-viral drug is said or difBsrent 

20 protease inhibitor. 

bone embodiment, the amino acid at position 11 of said protease is an amino add 
havmg a neutral, hydrophobic or non-polar side chain, hi another embodiment, tiie amino 
add at position 1 1 of said protease is I or L. In another embodiment, the amino add at 
position 33 of said protease is an amino add with a neutral, hydrophobic or non-polar side 

25 chain. In another embodiment, the amino add at position 33 of said protease is F. In another 
embodiment, the amino acid at position 34 of said protease is an amino add having a neutral, 
polar or hydtophilic side chain. In another embodiment, the amino acid at position 34 of said 
protease is Q. In another embodunent, the amino acid at position 43 of said protease is an 
amino add with a neutral, hydrophiUc or polar side cham. In another embodiment, the ammo 

30 acid at position 43 of said protease is T. hi another embodiment, the amino acid at position 
48 of said protease is an anuno acid with a neutral, hydrophobic or non-polar side chain, fa 
another embodiment, the amino add at position 48 of said protease is M. In another 
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embodiment, the amino add at position 54 of said protease is an amino acid with a neutral, 
hydrophobic, non-polar, hydrophilic or polar side chain. In another embodiment, the amino 
acid at position 54 of said protease is an amino acid with a neutral, hydrq>hobic or non-polar 
side chain. Iri another embodiment, the amino acid at position 54 ofsaid protease is A. Jn 
S another embodiment, the amino acid at position 54 of said protease is an amino add with a 
neutral, hydrophilic or polar side chain. In another embodiment, the amino add at position 
54 of said protease is S or T. In another embodiment, the amino acid at position 58 ofsaid 
protease is an amino acid with an acidic, hydrophilic or polar side chain. In another 
embodiment, the amino acid at position 58 of said protease is E. In another embodiment, the 

10 amino add at position 71 of said protease is an amino acid with a neutral, hydrophobic or 
non-polar side chain. In another embodiment, the amino acid at position 71 of said protease 
is L. In anoth^ embodiment, the amino add at position 76 of said protease is an amino acid 
with a neutral, hydrophobic or non-polar side chain. In anodier embodiment, the amino acid 
at position 76 of said protease is V. In another embodiment, the amino acid at position 79 of 

1 5 said protease is an amino acid with a neutral, hydrophobic, non-polar, acidic, hydrophilic or 
polar side chain. In another embodiment, ttie amino acid at position 79 of said protease is an 
amino add with a neutral, hydrophobic or non-polar side chain. In another embodiment, the 
amino acid at position 79 of said protease is an amino acid with an acidic, hydrophilic or 
polar side chain. In another mibodiment, the amino acid at position 79 of said protease is 

20 anyaminoacid, withtheprovisothatitisnot aP. In another embodiment, the amino add at 
position 82 of said protease is an amino acid with a neutral, hydrophobic or polar!^de chain. 
In another embodiment, the amino acid at position 82 ofsaid protease is A or F. In another 
embodiment, the amino acid at position 83 of said protease is an amino add with an acidic, 
hydrophilic or polar side chain. In another embodiment, the amino acid at position 83 ofsaid 

25 protease is D. In another embodiment, the amino add at position 84 of said protease is an 
amino acid with a neutral, hydrophobic, non-polar, hydrophilic or polar side chain. In 
another embodiment, the amino acid at position 84 of said protease is an amino add with a 
neutral, hydrophobic or non-polar side chaiiL In another embodiment, the amino acid at 
position 84 of said protease is A. In another embodiment, the amino add at position 84 of 

30 sdd protease is an amino acid with a neutral, hydrophilic or polar side chain. Ihanother 
embodiment, the amino acid at position 84 of said protease is C. In another embodiment, the 
amino acid at position 91 of said protease is an amino acid with a neutral, hydrophobic, non- 
polar, hydrophilic or polar side chain. In another embodiment, the amino add at position 91 
of said protease is an amino acid with a neutral, hydrophobic or non-polar side chain. In 
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another embodiment, the amino acid at position 91 of said pnrtease is A or V. fa another 
embodiment, the amino add at position 91 of said protease is an amino add with a neutral, 
hydrophiUc or polar side chain, fa another embodiment, the amino add at position 91 of said 
protease is S. fa another embodiment, the amino acid at position 95 of said protease is an 

5 amino acid with a neutral, hydrophobic or non-polar side chain, fa another embodiment, the 
amino acid at position 95 of said protease is F. 

fa another aspect, the invention provides a method to detecting the presence or 
absence of a mutation assodated with reduced susceptibiUty to treatment with said protease 
inhibitor at at least 2, 3. 4. 5. 6, 7. 8, 9. 10. 11, 12. 13. 14. 15, 16, 17. 18 or 19 of the amino 

10 acid positions. 

4. llnnCF DESCRIPTION OF T HE WGURES 

FIG. 1 is a diagrammatic representation of the genomic structure of HIV-1 . 
no. 2 is a flow-chart depicting the steps followed to arrive at a final set of samples 
that was analyzed m detail. 
15 FIG. 3A shows the amino add sequence of the NL4-3 HIV (GcnBank Accession No. 

P12497) protease (SBQ. ID. NO: 1). 

FIG. 3B shows the nucldc add sequence fiar the NL4-3 HIV (GenBank Accession 
No. AF324493) protease ^e (SBQ. ID. NO: 2). 

FIG. 4 shows the tree generated by CART analysis of 2499 samples. 
20 nG. 5 is a matrix of pairs of mutations associated witii amprenavir resistance. 

FIG. 6 shows the tree generated by the next round of CART analysis of 2499 samples. 
FIG. 7 shows the re-scnsitization to amprenavir resistance of an I50V containing virus 
by N88S. Phenotypic susceptibiUty of clones containing I50V; I50V and N88S; or I50V and 
L90M to protease mhibitors with the mean fold-change (error bars represent one standard 
25 deviation) for each group of clones is shown;. Drug names are abbreviated as follows: APV, 
amprenavir, IDV. indinavir. LPV, lopinavir, NFV, nelfinavir, RTV. ritonavir. SQV, 
saqumavir, ATV, atazanavir (EMS 232632). 



-8- 



wo 2004/003514 



PCT/US2003/021335 



5. DETAILED DESattPTlON OF THE INVENTION 

The present invention provides methods and compositions for developing an 
algorithm for determining die effectiveness of anti-viral dmgs based on a comprehensive 
analysis of paired phenotypic and genotypic data guided by phenotypic clinical cut-offi. The 
S present invention also provides methods for determining the susceptibility of a virus to an 
anti-viral treatment, methods for detoinining the effectiveness of an anti-viral treatment of an 
individual mfected with a virus, and methods of monitoring the clinical progression of viral 
infection in individuals receiving anti*viial treatment In another aspect, the present 
invention also provides nucleic adds and polypeptides comprising a mutation in the protease 
10 of a human immunodeficiency virus ("HIV") associated with reduced susceptibility to 
protease inhibitors, e.g., amprenavir. 

5.1 Abbreviatioiis 

"APV** is an abbreviation for the protease inhibitor amprenavir. 
*TS1 is an abbreviation for protease inhibitor. 

15 * yr-R" and "PT-S" are abbreviations for "phenotypically resistant" and 

*^henotypically sensitive,"* respectively. 

"GT-R" and ' 'GT-S" are abbreviations for "genotypically resistant** and 
*'genotypically sensitive,'* respectively. 

* TCR** is an abbreviation for '^lymerase chain reaction.*' 

20 'TE: is an abbreviation for 'Told change.** 

The amino acid notations used herein for the twenty genetically encoded L-amino 
acids are conventional and are as follows: 



Amino Add 


One-Letter 


Three Letter 




Abbrevtadon 


Abbreviation 


Alamne 


A 


Ala 


Ai^^onte 


R 


Aig 


A^saragine 


M 


Asn 


Asparticacid 


D 


Asp 


Cysteine 


C 


Cys 


Glotamine 


Q 


Gin 


Glutamic acid 


E 


Glu 


Glycine 


G 


Gly 
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Amino Acid 



Histidioe 

Isoleucine 

Leucine 

Lysine 

Mediioninc 

Phenylalanine 

Proline 

Smie 

Threonine 

Tryptophan 

Tymmc 

Valine 



One-Letter 
Abbreviation 

H 

I 

L 

K 

M 

F 

P 

S 

T 

W 

Y 

V 



Three Letter 
Abbreviation 

His 

He 

Leu 

Lys 

Met 

Phe 

Pro 

Ser 

Thr 

Tip 

TVr 

Val 



Unless noted otherwise, when polypeptide sequences are presented as a series of one- 
letter and/or three-letter abbreviations, the sequences are presented in the N -> C direction, ia 
accordance with common practice. 

Individual amino adds in a sequence are represented herein as AN, wherein A 
standard one letter symbol for the amino acid in the sequence, and N is the position in the 
sequence. Mutations are represented herein as A1NA2, wherein Ai is the standard one letter 
symbol for the amino acid in the reference protein sequence, A2 is the standard one letter 
symbol for the amino acid in the mutated protem sequence, and N is the position in the amino 
acid sequence. For example, a G25M mutation represents a change from glycine to 
methionine at amino acid position 25. Mutations may also be represented herein as NA2, 
wherein N is the position in the amino acid sequence and A2 is the standard one letter symbol 
for the amino acid in the mutated protein sequence 25M, for a change from the wild- 
type amino acid to methionine at amino acid position 25). Additionally, mutations may also 
15 be represented herein as AiN. wherein A, is the standard one letter symbol for the amino acid 
in the reference protein sequence and N is the position in the amino acid sequence {e,g, , G25 
represents a change from glycine to any amino acid at amino acid position 25). Thisnotation 
is typically used when the toino acid in the mutated protein sequence is either not known or, 
if the amino acid in the mutated protein sequence could be any amino acid, except that found 
in the reference protein sequence. The amino add positions are numbered based on the 
full-length sequence of the protein from which the region encompassing the mutation is 
derived. Representations of nucleotides and point mutations in DNA sequences are 
analogous. 



20 
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The abbreviations used fhroughout (he specification to refer to nucleic acids 
comprising specific nucleobase sequences are the conventional one-letter abbreviations. 
Thus, when included in a nucleic add, the naturally occurring encoding nucleobases are 
abbreviated as fi>llows: adenine (A), guanine (G), cytosine (C), thyniine (T) and uracil (U). 
S Unless specified otherwise, single-stranded nucleic acid sequences that ate represented as a 
series of one-letter abbreviations, and the top strand of double-stranded sequences, are 
presented in the 5* -> 3* direction. 



5.2 Definitions 

As used herein, the following terms shall have the following meanings: 

10 Unless otherwise specified, * *primarv mutation" refers to a mutation that affects the 

enzyme active site, i.e at those amino acid positions that are involved in the enzyme- 
substrate complex, or that reproducibly s^pears in an early round of rq>lication whm a virus 
is sulgect to the selective pressure of an anti-viral agent, or, that has a large efifect on 
phenotypic suscq)t2bility to an anti-viral agent 

1 S ' *Secondarv Mutation" refers to a mutation that is not a primary mutation and that 

contributes to reduced susceptibility or compensates for gross defects imposed by a primary 
mutatioiL 

A ' ^phenotypic assay** is a test that measures the sensitivity of a virus (such as HIV) to 
a specific anti-viral agent. 

20 A "genotvpic assay** is a test that determines a genetic sequence of an organism, a part 

of an organism, a gene or a part of a gene. Such assays are frequently performed in HIV to 
establish whether certain mutations are associated with dmg resistance are present. 

As used hoein, "mnotvpic data** are data about the genotype o^ for example, a virus. 
Examples of genotypic data include, but are not limited to, the nucleotide or amino acid 
25 sequoice of a virus, a part of a vims, a viral gene, a part of a viral gene, or the identity of one 
or more nucleotides or amino acid residues in a viral nucleic acid or protein. 

"Susceptibility ^ refers to a virus' lesponse to a partieiilar dnig, A virus that has 
decreased or reduced suscq>tibility to a drug has an increased resistance or decreased 
sensitivity to the dmg. A virus that has increased or enhanced or greater susceptibility to a 
30 drug has an increased sensitivity or decreased resistance to the drug. 
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Phenotypic susceptibiUty of a virus to a given dnig is a continuum. Nonetheless, it is 
piacticaUy useful to define a threshold or thresholds to simplify interpretation of apaiticular 
fold-change result. For drugs where sufficient clinical outcome data have been gathered, it is 
possible to define a "clinical cutoff value," as below. 

5 "Clinical Cutoff Value" refers to a specific point at which resistance begins and 

sensitivity ends. It is defined by the drug susceptibiUty level at which a patient's piobabiUty 
of treatment failure widi a particular drug significantly mcreases. The cutoff value is 
different for different anti-viral agents, as detemimed in clmical studies. Clinical cutoff 
values are determmed in clinical trials by evahiating resistance and outcomes data. Drag 

10 susceptibility (phenotypic) is measured at treatment initiation. Treatment response, such as 
change in viral load, is monitored at predetermined time points through the course of the 
treatment. The drug susceptibility is correlated with treatment response and the clmical 
cutoff value is determined by resistance levels associated with treatment feilure (statistical 
analysis of overall trial results). 

15 "ISt: refers to bhibitory Concentration. It is the concentration of drag in the patient's 

blood or in vitro needed to suppress the reproduction of a disease-causing microorganism 
(such as HIV) by a%. Thus, "ICar refers to the concentration of an anti-viral agent aft which 
virus replication is inhibited by 50% of the level observed in the absence of the drug. 
•Tatient ICsv refers to the drug concentiation required to inhibit replication of the virus fiom 

20 a patient by 50% and "reference ICjr refers to the drug concentration required to inhibit 
repUcation of a reference or wild-type virus by 50%. Similarly, 'TGac refers to the 
concentration of an anti-viral agent at which 90% of virus replication is mhibited. 

A "fi2ld£S^g£ » a numeric comparison of tiie drag susceptibiUty of a patient virus 
and a drug-sensitive reference vims. It is the ratio of the Patient ICjo to the drug-sensitive 

25 reference IC50, /.c. Patient ICjo/Reference IC50 - Fold Change CTC). A fold change of 1.0 
indicates that the patient virus exhibits the same degree of drug susceptibiUty as the 
drug-sensitive reference virus. A fold change less than 1 indicates the patient virus is more 
sensitive than the drug-sensitive reference virus. A fold change greater than 1 indicates the 
patient virus is less susceptible than the drag-sensitive reference virus. A fold change equal 

30 to or greater than the cUnical cutoff value means the patient virus has a lower probabiUty of 
response to that drug. A fold change less than the clinical cutoff vahie means lbs patient 
virus is sensitive to that drug. 
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"Amprenavir Fold Change" refers to the ratio of the ICso of amprenavir against the 
HIV from the patient plasma saxnph to the IC50 for amprenavir against the NL4-3 (GenBank 
Accession No. AF324493) reference viral straiiL 

A virus is "sensitive" to APV if it has an APV fold change less than 2.5. 

5 A virus is ' *resistantf* to APV if it has an APV fold change of 2.5 or more. 

A virus has an "increased likelihood of having reduced susceptibility^* to an anti-viral 
tceatmmt if the virus has a property^ for exanq>le, a mutation, that is correlated with a 
reduced susceptibility to the anti-viral treatment A property of a virus is correlated with a 
reduced susceptibility if a population of viruses having the property is, on average, less 
10 susceptible to the anti-viral treatment than an otherwise similar population of viruses lacking 
the property. Thus, the correlation between the presence of the property and reduced 
suscq)tibility need not be absolute, nor is there a requirement that the property is necessary 
{Le,, diat the property plays a causal role in reducing susceptibility) or su£Gcient that the 
presence of the property alone is sufiScient) for conferring reduced suscq)tibility. 

15 The term "% sequence homologv" is used interchangeably herein with the terms 

"% homologv. ** "% sequence identitv" and identity*' and refers to the level of amino acid 
sequence identity between two or more peptide sequences, when aligned using a sequence 
alignment pro gr am . For exan^le, as used herein, 80% homology means the same thing as 
80% sequence identity determined by a defined algorithm, and accordingly a homologue of a 

20 given sequence has greater than 80% sequence identity over a length of the given sequence. 
Exenq>lary levels of sequence identity include, but are not limited to, 60, 70, 80, 85, 90, 95, 
98% or more sequence identity to a given sequence. 

Exeiiq>lary conq)uter programs which can be used to determine identity between two 
sequences include, but are not limited to, the suite of BLAST programs, e.g., BLASTN, 

25 BLASTX. and TBLASTX, BLASTP and TBLASTN, publicly available on the hitemet at 
ht4>!//www.ncbi Jibn.nih.gov/BLAST/. See also Altschul et al., 1990, J. MoL Biol 
215:403-10 (with special referssce to the published de&ult setting, i.e., parameters w=^, 
t=17) and Altschul et aL, 1997, Nucleic Acids Res., 25:3389-3402. Sequence searches are 
typically carried out using the BLASTP program when evaluating a given amino acid 

30 sequence relative to amino acid sequences in the GenBank Protein Sequences and other 
public databases. The BLASTX program is preferred for searching nucleic acid sequences 
that have been translated in all reading frames against amino acid sequences in the GenBank 
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Protein Sequences and other pubUc databases. Both BLASTP and BLASTX aie run using 

default parameters of an open gap penalty of 1 1.0, and an extended gq) penalty of 1.0. and 

utilize the BLOSUM-62 matrix. See Altschul. et al., 1997. 

A piefeired alignment of selected sequences in orda to detemiine "% identity" 
5 between two or more sequences, is perfomied using for example, the CLUSTAL-W prognun 

in MacVector version 6.5, operated with default parameters, including an open gap penalty of 

10.0, an extended gap penalty of O.l, and a BLOSUM 30 similarity matrix. 

Amino Acid" refen to 8 hydrophilic amino acid having a side chain tfiat is 

uncharged at physiological pH, but which has at least one bond in which the pair of electrons 
10 shared in common by two atoms is held more closely by one of the atoms. Genetically 

encoded polar amino adds include Asn (N), Ghi (Q) Ser (S) and Thr (T). 

"Nnn pnlar Amino Acid" refers to a hydrophobic amino acid havmg a side chain that 
is uncharged at physiological pH and which has bonds in which the pair of electrons shared in 
common by two atoms is generally held equally by each of the two atoms (ie., the side chain 

15 is not polar). Genetically encoded apohr amino acids include Ala (A), Gly (G), He (I), Leu 
(L).Met(M)andVal(V). 

"Hvdroohilic Arninn Acid" refers to an amino add exhibiting a hydrophobicity of less 
than zero accoiding to the normalized consensus hydrophobicity scale of Ksenberg et al.. 
1984, J. Mol. Biol. 179:125-142. Genetically encoded hydrophilic amino acids include Arg 

20 (R). Asn (N), Asp (D), Glu (E). Gto (Q), His (H), Lys (K). Ser (S) and Thr(T). 

"Hvditmhobic Aminn Add" refeis to an amino acid exhibiting a hydrophobidty of 
greater than zero accoiding to the nomiaHzed consensus hydrophobidty scale of Ksenberg et 
aL, 1984, J. Mol. Biol. 179:125-142. GeneticaUy encoded hydrophobic amino adds include 
Ala (A). Gly (G). lie (I). Leu (L), Met (M). Phe (F), Pro CP). Tip (W), lyr (Y) and Val (V). 

25 "AriHi. . ^ i ^iiift Add" refeis to a hydrophilic amino add having a side chain pK value 

of less than7. Acidic amino acids typically have negatively charged side chains at 
physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino adds 
include Asp (D) and Glu (E). 

♦ ♦Basic Amino Acid" refers to a hydrophilic amino acid having a side chain pK value 

30 of greater than7. Basic amino acids typically have positively charged side chains at 

physiological pH due to assodation with hydronium ion. Genetically encoded basic amino 
acids include Arg (R), His (H) and Lys (K). 
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A ' imitation" is a change in an amino acid sequence or in a corresponding nucleic 
acid sequence relative to a reference nucleic acid or polypeptide. For embodiments of the 
invention comprising HIV protease or reverse transcriptase, the refermce nucleic acid 
encoding protease or reverse transcriptase is the protease or reverse transcriptase coding 

5 sequence, respectively, present in NL4-3 HIV (GenBank Accession No. AF324493). 

Likewise, the reference protease or reverse transcriptase polypeptide is that encoded by the 
NL4-3 HIV sequence. Although the amino acid sequence of a pqptide can be determined 
directly by, for example, Edman degradation or mass spectroscopy, more typically, the amino 
sequence of a peptide is inferred from die nucleotide sequence of a nucleic add diat encodes 

10 the peptide. Any method for determining the sequmce of a nucleic acid known in the art can 
be used, for example, Maxam-Gilbert sequencing (Maxam et aL, 1980, Methods in 
Enzymology 65:499), dideoxy sequencing (Sanger et al, 1977, Proc. Natl Acad, Set. USA 
74:5463) or hybridization-based ^roaches {see e.g.. Sambrook ei al., 2001, Molecular 
aoning: A Laboraioiy Manual, Cold Spring Harbor Laboratory, 3"* ed., NY; and Ausubel et 

15 a/., 1989, Current Protocob in Molecular Biology, Greene PubKdiing Associates and Wiley 
Interscience, NY). 

A ' distance-associated mutation" Q'RAM'") in a virus is a mutation correlated with 
reduced susceptibility of the virus to anti-viral agents. A RAM can be found in several 
viruses, including, but not limited to a human immunodeficiency virus CmV*"). Such 
20 mutations can be found in one or more of the viral proteins, for example, in the protease, 

integrase, envelope or reverse transcriptase of HIV. A RAM is defined relative to a reference 
strain. For embodiments of the invention comprising HIV protease, the reference protease is 
the protease encoded by NL4-3 HIV (GenBank Accession No. AF324493). 

A * *mutant" is a virus, gene or protein having a sequence that has one or more changes 
25 relative to a reference virus, gene or proteixL 

The tenns ' "peptide. " ^ ^olvpentide" and ' *protem" are used interchangeably 
througliout 

The toms * ^ference" and * *wild-tvpe" are used interchangeably througihout. 

The terms ' "polynucleotide, ** "oliponucleotide" and ' ^nucleic acid" are used 
30 interchangeably throu^out. 
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The present invention provides nucleic acids and polypeptides comprising a mutation 
in the protease of HIV. Preferably, the HIV is human immunodeficiency virus type 1 CHN- 
1"). In one embodunent, the mutation is associated with reduced susceptibility to a protease 
mhibitor. In another embodiment, the mutation is associated with increased susceptibility to 
a protease inhibitor. Hie protease inhibitor can be any protease inhibitor known to one of 
skin in the ait Examples of protease inhibitors include, but are not limited to, sa«iuinavir, 
ritonavir, indinavir, nelfinavir, aaapienavir and lopinavir. fa one embodiment, the protease 
inhibitor is amprenavir. 

fa one aspect, the present mvention provides peptides, polypeptides or proteins 
comprismg a mutation m the protease of HIV associated with either reduced or increased 
susceptibiHty to a protease infaWtor, eg., amprenavir. fa one embodiment, the invention 
provides a polypeptide derived fiom the HIV protease and comprising a mutation associated 
with reduced susceptibiKty to a protease inhibitor, fa another embodiment, the polyp«q>tide 
comprises more than one mutation associated with reduced suscqitibiHty to a protease 
inhibitor, fa another embodiment, the polypeptide comprises a mutation associated with 
mcieased susceptibiUty to a protease inhibitor, fa another embodiment, the polypeptide 
comprises more than one mutation associated with increased susceptflrility to a protease 
inWbitor. Polypeptides of the mvention include peptides, polypeptides and proteins that are 
mbdified or derived fiom these polypeptides, fa one embodiment, the polypeptide comprises 
post-translational modifications, fa another embodiment, the polypeptide comprises one or 

more ammo acid analogs. 

In a preferred embodiment, the polypeptide comprises one or more mutations 

associated with reduced susceptibiKty to amprenavir. Table 1 provides a list of mutations 

associated wifli reduced susceptibility to amprenavir. 

fa another preferred embodiment, the mvention provides a polypeptide derived fiom 

the HIV protease and comprising at least one mutation selected fiom a group of mutations 

consisting of: VllI, VllL, L33F, E34Q, K43T, G48M. I54A. I54S, I54T, Q58E, A71L, 

L76V, P79, V82A, V82F, N83D, I84A, I84C, T91 A, T91S, T91V and C95F, wherem the 

mutation is associated with reduced susceptibility to amprenavir. 
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In another preferred embodiment, the polypeptide comprising said mutation 
comprises at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 85, 90 or 95 contiguous amino 
acids of SEQ ID NO: 1, within which sequence said mutation or mutations can be present. 

in another embodiment, the polypeptide comprising said mutation or mutations is at 
5 least 70%, but less than 100%, identical to a polypeptide having the amino acid sequence of 
SEQ ID NO:l; the polypeptide has an amino acid sequence Ihat is greater than 80% identical 
to the amino acid sequrace of SEQ ID N0:1; or the polypeptide has an amino acid sequence 
that is greater than 90% identical to the amino acid sequence of SEQ ID NO:l; wherem the 
mutation is associated with reduced susceptibility to a protease inhibitor. 

10 In one embodiment, said polypeptide is naturally-occurring. In another embodiment, 

said polypeptide is artificially designed. 

To detennine the percent identity of two amino acid sequences or of two nucleic 
acids, tiie sequences are aligned for optimal comparison purposes (e.g. , gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 

1 5 alignment with a second anaino or nucleic acid sequence). The amino acid residues or 

nucleotides at corresponding amino acid positions or nucleotide positions are then compared. 
When a position in the first sequence is occupied by the same amino acid residue or 
nucleotide as the corresponding position in the second sequence, thm the molecules are 
identical at that position. The percent identity between the two sequences is a function of the 

20 number of id^cal positions shared by the sequences (% identity = # of identical 

positions/total # of positions ie.g., ov^l^ping positions) x 100). In one embodiment, the 
two sequences are the same length. 

The determination of percent identity between two sequences can be accomplished 
using a mathematical algorithm. A preferred, non-limiting example of a mathematical 

25 algorithm utilized for the comparison of two sequences is the algorithm of Karlin and 

Altschul (1990) Proc. Natl Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul 
(1993) Pmc. Natl Acad. Sci. USA 90:5873-5877. Such an algoritfun is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al (1990) J. Mol Biol 215:403-410. 
BLAST nucleotide searches can be performed with the NBLAST program, score - 100, 

30 wordlength = 12 to obtain nucleotide sequences homologous to a nucleic add molecules of 
the invention. BLAST protein searches can be performed with the XBLAST program, score 
50, wordlength 3 to obtain amino acid sequences homologous to a protein molecules of 
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the invention. To obtain gapped alignments for comparison purposes, Gspped BLAST can 
be utilized as described in Altschul et al (1997) Nucleic Acids Res, 25:3389-3402. 
Alternatively, PSI-Blast can be used to peifonn an iterated search that detects distant 
relationships between molecules. Id When utilizing BLAST, G^ed BLAST, and PSI- 
Blast programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used- 5cc http://wwwjicbi.nlmjuh.gov. 

Another preferred, non-limiting example of a nuithematical algorithm utilized for the 
comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an 
algorithm is incorporated into the AUGN program (version 2.0) Aat is part of the CGC 
sequence alignment software package. When utilizing the ALIGN program for conq)aring 
amino acid sequences, a PAM120 weigjit residue table, a gap length penalty of 12, and a gap 
penalty of 4 can be used. Additional algorithms for sequence analysis are known in the art 
and include ADVANCE and ADAM as desoibed in Torellis and Robotti (1994) Ck>mpuL 
AppL Biasci., 10:3-5; and FASTA described in Pearson and Lipman (1988) Proc Natl Acad, 
Set 55:2444-8. Within FASTA, ktup is a control option that sets the srasitivity and speed of 
the search. If ktup=2, similar regions in the two sequences being compared are found by 
looking at pairs of aUgned residues; if ktup=l, single aUgned amino acids are examined, ktup 
can be set to 2 or 1 for protein sequences, or fiom 1 to 6 for DNA sequences. The defeult if 
ktup is not specified is 2 for proteins and 6 for DNA. 

The percent identity between two sequences can be detOTuined using teclmiques 
similar to ttiose described above, with or without allowing ga|)S. In calculating percent 
identity, typically exact matches are counted. 

In another aspect, the present invention provides polynucleotides, oligonucleotides or 
nucleic acids encoding or relating to a polypeptide of the invention or a biologically active 
portion thcreol^ mcluding, for example, nucleic add molecules suflBcicnt for use as 
hybridization probes, PCR primers or sequencing primOT for identifying, analyzing, mutating 
or amplifying the nucleic acids of the invention. 

In one embodiment, the nucleic acid encodes a polypeptide comprising a mutation in 
the protease of HIV associated with either reduced or increased susceptibility to a protease 
inhibitor, e.g., amprenavir. In one embodiment, the invention provides a nucleic acid 
encoding a polypeptide derived from the HIV protease and comprising one or more mutations 
associated with reduced suscq)tibility to a protease inhibitor. In another embodiment, the 
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nucleic acid encodes a polypeptide ihat comprises one or more mutations associated wifli 
increased suscq>tibility to a protease inhibitor. Nucleic acids of the invention include nucleic 
acids, polynucleotides and oligonucleotides that are modified or derived fiom these nucleic 
acid sequences, hi one embodiment, the nucleic acid comprises nucleic acid analogs. In one 
5 embodiment, the na is naturally-occurring. In another embodiment, said nucleic acid is 
artificially designed. 

The nucleic acid can be any length. The nucleic acid can be, for example, at least 6, 
7, 8, 9, 10, 11. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28. 29, 30, 31, 32. 
33, 34. 35. 36. 37, 38. 39. 40. 45. 50. 55, 60, 65. 70. 75, 80, 85, 90. 100. 110. 120. 125. 150. 

10 175. 200, 250, 300, 350, 375, 400, 425. 450, 475 or 500 nucleotides in length. The nucleic 
acid can be, for example, less than 3. 4. 5, 6, 7, 8. 9, 10, 1 1, 12, 13. 14, 15, 16, 17, 18, 19, 20, 
21, 22, 23. 24, 25. 26. 27, 28, 29. 30, 31, 32. 33, 34, 35. 36, 37. 38, 39. 40. 45, 50, 55, 60. 65, 
70. 75. 80, 85, 90, 100, 1 10, 120, 125, 150, 175, 200, 250, 300, 350, 375, 400, 425. 450, 475, 
500, 525, 550, 575. 600. 650, 700. 750. 800, 850. 900, 950, 1000. 1100, 1200, 1300. 1400. 

15 1500. 1600. 1700, 1800. 1900. 2000. 2500, 3000. 3500. 4000. 4500, 5000, 5500. 6000. 6500. 
7000, 7500, 8000. 8500, 9000, 9500 or 10000 nucleotides in length. In a piefened 
embodimrat, the nucleic acid has a length and a sequence suitable for detecting a mutation 
described herein. for example, as a probe or a primer. 

In one embodiment, the nucleic acid encodes a polypeptide, or a portion of a 
20 polypeptide, that comprises one or more mutations associated with reduced suscq>tibility to 
anqirenavir. Table 1 provides a list of mutations associated with reduced suscqptibility to 
amprenavir. 

In another embodiment, the invention provides an oligonucleotide encoding a 
polypeptide derived fiom the HIV protease and comprising at least one mutation selected 
25 fiom a groiq> of mutations consisting of: VI II. VI IL. U3F, E34Q. K43T. G48M, I54A. 

I54S, I54T, Q58E. A71L. L76V, P79. V82A. V82F. N83D, I84A, I84C. T91 A, T91S, T91 V 
andC95F. 

In another embodiment, said oligonucleotide conqprising said mutation comprises 15. 
30. 45, 60, 75. 90. 105, 120. 135. 150. 180, 210, 240. 255, 270 or 285 contiguous nucleic 
30 acids of SEQ ID NO: 2. within which sequence said mutation or mutations can be present 

Jn another embodiment, the oligonucleotide comprising said mutation or mutations is 
at least 60%, but less than 100%, identical to an oligonucleotide having the nucleic acid 
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sequence of SEQ K) NO:2; the oligonucleotide has an nucleic acid sequence that is greater 
than 70% identical to the nucleic acid sequence of SEQ ID N0:2; the oligonucleotide has an 
nucleic acid sequence that is greater than 80% identical to the nucleic acid sequence of SEQ 
ID N0:2; or the oligonucleotide has an nucleic acid sequence that is greater than 90% 

5 identical to the nucleic acid sequence of SEQ ID NO:2, wherein the mutation is associated 
with reduced susceptibihty to a protease mhibitor. The percent identity of two nucleic acid 
sequences can be determined as described above. 

In addition to the nucleotide sequence of SEQ ID NO: 2, it wiU be qjpreciated by 
those skilled in the art that DNA sequence polymorphisms fliat lead to changes m the amino 

10 arid sequence may exist within a population ie.g.y the human population). Such genetic 
polymorphisms may exist among mdividuals within a population due to natural allelic 
variation. Natural aUeUc variations can typically result in 1-5% variance in the nucleotide 
sequence of a given gene. Any and all such nucleotide variations and resulting amino acid 
variations or polymorphisms that are the result of natural allelic variation and that do not alter 

1 5 the fimctional activity are intended to be within the scope of the invention. 

In another embodiment, the present invention provides nucleic add molecules that are 
suitable for use as primers or hybridization probes for the detection of nucleic arid sequences 
of the invention. A nucleic acid molecule of the invention can comprise only a portion of a 
nucleic acid sequence encoding a full length polypeptide of tiie invention for example, a 

20 fragment that can be used as a probe or primer or a fragment encoding a biologically active 
portion ofapolypeptide of the invention. The probe can comprise a labeled group attached 
thereto, e.g., a radioisotope, a fluorescent con^wund, an enzyme, or an enzyme co-fector. hi 
various CTibodiments, the nucleic acid molecules of the mvention can be modified at the base 
moiety, sugar moiety or phosphate backbone. 

25 5.4 Finding Drag Resistance-Associated Viral Mntations 

In another aspect, the present invention provides methods for finding resistance- 
associated mutation in a virus or a derivative of the virus. 

5.4.1 The Vtnis and Viral Samples 
A resistance-associated mutatioii ("RAM*") according to the present invention can be 
30 present in any type of virus, for example, any virus found m animals. In one embodiment of 
the invention, the virus includes viruses known to infect mammals, including dogs, cats. 
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horses, sheq), COWS e/c. In a preferred embodiment, the virus is known to In 
an even more pieforred embodiment the vinis is known to infect humans. Examplesof 
human viruses include, but are not limited to, human immunodeficiency virus QTSyT^, 
herpes simplex virus, cytomegalovirus virus, varicella zoster virus, other human herpes 

S viruses, influenza A virus, respiratory syncytial virus, hq>atitis A, B and C viruses, 

rhinovirus; and human papilloma virus, hi a preferred embodiment of the invention, the virus 
is HIV. Preferably, flie virus is human immunodeficiency virus type 1 CWV-l**). The 
foregoing are representative of certain viruses for which there is presoatly available anti-viral 
chemother^y and represent the viral femilies retroviiidae, hetp^viridae» orthomyxoviiidae, 

10 paramxyxovirus, picornavirus, flavivirus, pneumovirus and hqiadnaviridae. This invention 
can be used with other viial infections due to other viruses within these families as well as 
viral infections arising from viruses in other viral femilies for which there is or there is not a 
currently available therapy. 

A RAM according to the present invention can be found in a viral sample obtained by 
1 S any means known in the art for obtaining viral samples. Such methods include, but are not 
United to, obtaining a viral sample from ahuman or an animal infected with the virus or 
obtaining a viral sample from a viral culture. In one embodiment, the viral sanq>le is 
obtained from a human individual mfected with the virus. The viral sanq>le could be 
obtained from any part of the infected individual's body or any secretion eiq>ected to contain 
20 the virus. Examples of such parts include, but are not limited to blood, serum, plasma, 
sputum, lyaq;)hatic fluid, sem^ vaginal mucus and samples of other bodily fluids, hi a 
preferred embodiment, the sample is a blood, serum or plasma sample. 

In another embodiment, a RAM according to the present invention is present m a 
virus diat can be obtained from a culture. In some embodiments, the culture can be obtained 
25 firom a laboratory. In other embodiments, the culture can be obtained from a collection, for 
example, the American Type Culture Collection. 

In certain embodiments, a RAM according to the present invention is presmt in a 
derivative of a virus. In one embodiment, the derivative of the virus is not itself patfiogenic. 
In another embodiment, the derivative of the virus is a plasmid-based system, wherein 
30 rephcation of the plasmid or of a ceU transfected with the plasmid is affected by the presence 
or absence of the selective pressure, such that mutations are selected that increase resistance 
to the selective pressure. In some embodiments, the derivative of the virus coiKq>rises the 
nucleic acids or proteins of interest, for example, those nucleic acids or proteins to be 
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targeted by an anti-viral treatment In one embodiment, the genes of interest can be 
incoipoiated into a vector. See. e.g., U.S. Patent Numbera 5.837.464 and 6.242.187 and PCT 
publication, WO 99/67427, each of which is incoipoiated herein by reference. In a prefeited 
embodiment, the genes can be those that encode for a protease or reverse transcriptase. 

5 In another embodiment, the intact virus need not be used. Instead, a part of the virus 

incorporated into a vector can be used. Preferably that part of the virus is used that is 
targeted by an anti-viral drug. 

In another embodiment, a RAM according to the present invention is present in a 
genetically modified virus. The virus can be genetically modified using any method knovni 

10 in the art for genetically modifying a virus. For example, the virus can be grown for a desired 
number of generations in a laboratory culture. In one embodiment, no selective pressure is 
applied (i.e., die virus is not subjected to a treatment that favors the replication of viruses 
with certain characteristics), and new mutations accumulate through random genetic drift. In 
another embodiment, a selective pressure is applied to the virus as it is grown in culture (i.e., 

15 the virus is grown under conditions that favor the rcpUcation of viruses having one or more 
characteristics). In one embodiment, the selective pressure is an anti-viral treatment Any 
known anti-viral treatment can be used as flie selective pressure. In one embodiment, the 
virus is HIV and the selective pressure is a protease inhibitor. In another embodiment, the 
virus is HIV-l and the selective pressure is a protease inhibitor. Any protease inWbitor can 

20 be used to apply the selective pressure. Examples of protease inhibitors include, but are not 
limited to, saquinavir, ritonavir, indinavir, nelfinavir, amprenavir and lopinavir. In one 
embodiment, the protease inhibitor is selected fix)m a group consisting of saquinavir, 
ritonavir, indinavir, nelfinavir, amprraavir and lopinavir. In another embodiment, flie 
protease inhftitor is amprenavir. By treating HIV cultured in vitro with a protease inhibitor, 

25 e.g., amprenavir, one can select for mutant strains of HIV that have an increased resistance to 
amprenavir. The stringency of the selective pressure can be manipulated to increase or 
decrease the survival of viruses not having flie selected-for characteristic. 

In another aspect, a RAM according to the present invention is made by mutagenizing 
a virus, a viral genome, or a part of a viral genome. Any method of mutagenesis known in 

30 the art can be used fortius purpose. In one embodimait, the mutagenesis is essentiaUy 
random. In another embodiment, the essentiaUy random mutagenesis is performed by 
exposing the virus, viral genome or part of the viral genome to a mutagenic treatment In 
another embodiment, a gene that encodes a viral protein that is the target of an anti-viral 
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therapy is mutagenized. Examples of essentiaUy nmdom mutagenic treatments inclwl^ for 
example, exposure to mutagenic substances {eg,, elhidium bwmide, ethylmethanesulphonate, 
ethyl nitroso urea (EMU) etc.) radiation (e.g.. ultraviolet Ught), the insertion and/or removal 
of transposable elements (eg.. Tn5, TnlO), orreplication in acell, ceU extract, or in vitro 
5 lepHcation system that has an increased rate of mutagenesis. See, e.g., RusseU et al, 1979, 
Proc. Nat Acad Sci. USA 76:5918-5922; Russell, W., 1982, Environmental Mutagens and 
Carcinogens: Pioceedings of the Third hitematiwial Conference on Environmental Mutagens. 
One of skin m the art will appreciate that while each of these methods of mutagenesis is 
essentially random, at a molecular level, each has its own preferred targets. 
10 hi another aspect, a mutation that might affect the sensitivity of a vims to an anti-viral 

therapy is made using sitwfirected mutagenesis. Any method of site-directed mutagenesis 
known m the art can be used {see e.g.. Sambrook et al., 2001, Molecular Cloning: A 
Laboratory Manual, Cold Spring Haibor Laboratory, 3'^ ed., NY; and Ausubel et al., 1989, 
Current Protocols in Molecular Biology, Greene PubUshmg Associates and Wiley 
15 faterecienoe, NY). The site directed mutagenesis can be directed to, e.g., a particular gene or 
genomic region, a particular part of a gene or genomic region, or one or a few particular 
nucleotides within a gene or genomic re^on. hi one embodiment, the site directed 
mutagenesis is directed to a viral genomic region, gene, gene fiagment, or nucleotide based 
on one or more criteria, hi one embodiment, a gene or aportion of a gene is subjected to site- 
20 directed mutagenesis because it encodes a protem that is known or suspected to be a target of 
an anti-viral therapy, e.g., the gene encoding the HIV protease, hi another embodiment, a 
portion of a gene, or one or a few nucleotides within a gene, arc selected for site-dhected 
mutagenesis, hi one embodunent, the nucleotides to be mutagenized encode ammo add 
residues ^ are known or suspected to mteract with an anti-viral compound, hi another 
25 embodunent, the nucleotides to be mutagenized encode amino add residues that are known 
or suspeded to be mutated in viral strains havmg decreased susceptfeiUty to the anti-viral 
treatment In another embodhnent. the mutagenized nucleotides encode anuno add residues 
that are adjacent to or near in the primary sequence of the protem residues known or 
suqiected to mteract with an anti-viial compound or known or suspected to be mutated m 
30 viral strams havmg decreased susceptibiUty to an anti-viral treatment hianotiier 

embodiment, the mutagenized nucleotides oicode ammo add residues diat are adjacait to or 
near to in the secondary, tertiary or quaternary structure of the protein residues known or 
suqiected to interact with an anti-viral compound or known or suspected to be mutated in 
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viral strains having decreased susceptibility to an anti-viral treatment In another 
embodiment, the mutagenized nucleotides encode amino acid residues in or near the active 
site ofa protein that is known or suspected to bind to an anti-viral compound. See,e.g., 
Sarkar and Sommer, 1990, Biotechniques, 8:404-407. 

S 5.4.2 Detecting the Presence or Absence of Mutations in a Viras 

The presnce or absence of a RAM according to the present invention in a virus can 
be detected by any means known in the art for detecting a mutation. The mutation can be 
detected in the viral gene that encodes a particular protein, or m the protein itself, i.e, in the 
amino acid sequence of the protein. 

10 In one embodiment, the mutation is in the viral genome. Such a mutation can be in, 

for example, a gaie encoding a viral protein, in a c£s or tram acting regulatory sequence ofa 
gene encoding a viral protein, an intergenic sequence, or an intron sequence. The mutation 
can affect any aspect of the structure, function, replication or environment of the virus that 
changes its susceptibility to an anti-viral treatment. In one embodiment, the mutation is in a 

15 gene encoding a viral protein that is the target of an anti-viral treatment. 

A mutation within a viral gene can be detected by utilizing a number of techniques. 
Viral DNA or RNA can be used as the starting point for such assay techniques, and may be 
isolated according to standard procedures which are well known to those of skill in the art 

The detection ofa mutation in specific nucleic acid sequences, such as in a particular 
20 region ofa viral gene, can be accomplished by a variety of methods including, but not limited 
to, restriction-fiagment-length-polymorphism detection based on allele-specific restriction- 
endonuclease cleavage (Kan and Dozy, 1978, Lancet li:910-912), mismatch-repair detection 
(Faham and Cox, 1995, Genome Res 5:474-482), binding of MutS protein (Wagner et ah, 
1995, Nud Adds Res 23:3944-3948), dmaturing-gradient gel electrophoresis (Fisher et al^ 
25 1983, Proc. Natl. Acad. ScL USA. 80:1579-83), single-strand-conformation-polymorphism 
detection (Orita et aL, 1983, Genomics 5:874-879), RNAase cleavage at mismatched base- 
pairs (Myers a/.. 1985, Science 230:1242), chemical (Cotton a/., l9SS,Proc, Natl. Acad. 
Sci. USA. 85:4397-4401) or enzymatic (Youil et aL, 1995, Proc^ Natl Acad. ScL USA, 
92:87-91) cleavage of heteroduplex DNA, methods based on oligonucleotide-specific primer 
30 extension (Syvtoen et fl/., 1990, Genomics 8:684-692), genetic bit analysis (Nikiforov et al, 
1994, Nucl Acids Res 22:4167-4175), oligonucleotide-ligation assay (Landegren et al., 1988, 
Science 241:1077), oligonucleotide-specific ligation chain reaction ("LCR*') (Banany, 1991, 
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Proa Natl. Acad Set, USA, 88:189-193), gap-LCR(Abravaya era/., \99S,Nud Acids Res 
23:675-682)^ radioactive or fluorescent DNA sequencing using standard procedures well 
known in the art» and peptide nucleic acid (PNA) assays (Qrum et al^ 1993, NucL Acids Res, 
21:5332-5356; Thiede et al., 1996, NucL Acids Res. 24:983-984). 

5 In addition, viial DNA or RNA may be used in hybridization or anq)Iification assays 

to detect abnormalities involving gene structure, including point mutations, insertions, 
deletions and genomic rearrangements. Such assays may include, but are not limited to. 
Southern analyses (Soutfiem, lSr75, / MoL Biol 98:503-517), single stranded conformational 
polymorphism analyses (SSCP) (Qrita et al, 1989, Proc. Natl Acad. Sd. USA 
10 86:2766-2770), and PGR analyses (U.S. Patent Nos. 4,683,202; 4,683,195; 4,800,159; and 
4,965,188; PGR Strategies, 1995 Lonis et al (eds.). Academic Press, Jnc). 

Such diagnostic methods for the detection of a gene-specific mutation can involve for 
example, contacting and incubating the viral nucleic acids with one or more labeled nucleic 
acid reagents including recombinant DNA molecules, cloned genes or degenerate variants 

1 5 thereof under conditions fovorable for the specific annealing of these reagents to their 

complementary sequences. Preferably, the lengths of these nucleic acid reagents are at least 
IS to 30 nucleotides. After incubation, all non-armealed nucleic acids are removed from the 
nucleic acid molecule hybrid The presence of nucleic acids which have hybridized, if any 
such molecules exist, is then detected. Using such a detection scheme, the nucleic acid fiom 

20 the virus can be immobilized, for example, to a solid support such as a membrane, or a plastic 
sur&ce such as that on a microtiter plate or polystyrene beads. In this case, after incubation, 
non-annealed, labeled nucleic acid reagents of die type described above are easily removed. 
Detection of die remaining, annealed, labeled nucleic acid reagents is accompli^ed using 
standard tedmiques well-known to those in the art. The gene sequences to which the nucleic 

25 add reagents have annealed can be conq>ared to the annealing pattern expected from a 
ncmnal gene sequence in order to determine whether a gene mutation is present. 

Alternative diagnostic methods for the detection of gene specific nucldc add 
molecules may involve their aniplification, e.g:, by PGR (U.S. Patent Nos. 4,683,202; 
4,683,195; 4,800,159; and 4,965,188; PGR Strategies, 1995 hmis et al (eds.). Academic 
30 Press, hic), followed by the detection of the amplified molecules using techniques well 
known to those of skill in the art The resulting amplified sequences can be compared to 
diose which would be expected if the nucldc acid bemg amplified contained only normal 
copies of the respective gene in order to detennine whether a gene mutation exists. 
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Additiomdly^e nucldc arid can be seqiicnced by any sequencing me^^ 
theart. Forexample,(heviialDNAcanbesequencedbythBdideoxymefliodofSangere/ 
aLA^, Proc Natl Acad. ScL t/W 74:5463, as ftither described by Messing c» a/., 1981, . 
Nuc. Acids Res. 9:309. or by the method of Maxam et al., 1980, Methods in Enzymology 
5 65:499. See also the techniques described in Sambrook et at., im. Molecular Qoning: A 
Laboratory Manual, Cold Spring Hariwr Laboratory. 3"* ed.,NY; and Ausubel e/o/.. 1989, 
Current Protocols in Molecular Biology, Greene Publishing Associates and WUey 
Inteisdence, NY. 

Antibodies directed against the viral gene products. i.e. . viral proteins or viral peptide 
10 ftagments can also be used to detect mutations in the viral proteins. Alternatively, the viral 
protein or peptide fragments of interest can be sequencedby any sequencing method known 
intheartinorderto yield the aminoaddsequenceoftheproteinofinterest. Anexamplcof 
such a method is the Edman degtadation method which can be used to sequence small 
proteins or polypeptides. Larger proteins can be initially cleaved by chemical or enzjfmatic 
15 reagents known in the art. for example, cyanogen bromide, hydroxylamine, trypsin or 
chymotrypsin, and then sequenced by the Bdman degradation method. 

5^ )VlM.auring PheF «««vnte Su8ceDtP'«««Y « Mutant Virus 
Any method known in the art can be used to determine the phenotypic susceptibiUty 
of a mutant vims or popuhition of viruses to an anti-viral therapy. See e.g., U.S. Patent Nos. 
20 5,837.464 and 6.242,187, incorporated herein by reference in their entirities. In sbme 

^bodiments a phenotypic analysis is performed, le. , the susceptibility of the virus to a given 
anti-viral agent is assayed with respect to the susceptibility of a reference virus without the 
mutations. This is a direct, quantitative measure of drug susceptibility and can be performed 
by any method known in the ait to detennine the susceptibility of a vims to an anti-viral 
25 agent. An example of such methods includes, but is not linuted to. determining the fold 

change in ICso values with respect to a reference virus. Phenotypic testing measures the 
abiUtyofaspecificviralstraintogrowinvitrointhepresenceofadruginhibitor. Avirusis 

less susceptible to a particular drug when more of the drug is required to inhibit vind activity, 
versus die amount of drug required to inhibit fl»e referaice virus. 

In one embodiment, a phenotypic analysis is performed and used to calculate the IC50 
or IC90 of a drug for a viral stram. The results of the analysis can also be presented as fold- 
change in ICo or IC90 for each viral strain as compared with a drug-susceptible control strain 



30 
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or a prior viral Strain fix)m the same padent Becausethevirusisdirectly exposed to each of 
the available anti-viral medications, lesults can be directly linked to treatment response. For 
example, if the patient virus shows resistance to a particular drug, that drug is avoided or 
omitted fiom the patient's treatment regimen, allowing the physician to design a treatment 

S plan that is more likely to be effective for a longer period of time. 

In another embodiment, the phenotypic analysis is performed using recombinant virus 
assays C*RVAs"). RVAs use virus stocks generated by homologous recombination between 
viral vectors and viral gene sequences, anq>lified firoro the patient virus, hi some 
embodiments, the viral vector is a HIV vector and the viral gene sequences are protease 

10 and/or reverse transcriptase sequences. 

In a prefenred embodiment, the phenotypic analysis is performed using 
PHENOSENSE™ (ViroLogic Inc., South San Francisco, CA). See Petropoulos et aL, 2000, 
Antimicrob. Agents Chemother. 44:920-928; US. Patent Nos. 5,837,464 and 6,242,187. 
PHENOSENSE™ is a phenotypic assay that achieves ttie benefits of phenotypic testing and 

1 5 overcomes the drawbacks of previous assays. Because the assay has been automated, 
PHENOSENSE™ offers higher thn)uglq)ut under controlled conditions. The result is an 
assay that accurately defines the susceptibility profile of a patient's HIV isolates to all 
currently available antiretroviral drugs, and delivers results directly to the physician within 
about 10 to about 15 days of sample receipt. PHENOSENSE™ is accurate and can obtain 

20 results with only one round of viral replication, thereby avoiding selection of sul^pulations 
of virus. The results are quantitative, measuring varying degrees of drug susceptibility, and 
sensitive - the test can be performed on blood specimens with a viral load of about 500 
copies/mL and can detect minority populations of some drug-resistant virus at concentrations 
of 10% or less of total viral population. Furthermore, the results are rq>roducible and can 

25 vary by less than about 1 .4-2.5 fold, depending on the drug, in about 95% of the assays 
performed. 

PHENOSENSE™ can be used with nucleic acids fiom amplified viral gene 
sequences. As discussed in Section 5.4.1, the sample containing the virus may be a sanq;>Ie 
fiom a human or an animal infected with ttie virus or a sample fiom a culture of viral cells. 
30 In one embodiment, the viral sample comprises a genetically modified laboratory strain. 

A resistance test vector ("RTV*) can then be constructed by incorporating the 
an^lified viral gene sequences into a rq}lication defective viral vector by using any method 
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known in the art of incoiporatmg gene sequences into a vector. In one embodiment, 
restrictions enzymes and conventional cloning methods are used. See Sambrook et ai, 2001, 
Molecular aoning: A Laboratory Manual, Cold Spring Haibor Laboratory, 3*^ ed., NY; and 
Ausubel et al., 1989, Current Protocols in Molecular Biology, Greene PubKshing Associates 

5 and Wiley Ihterscience, NY. In a preferred embodiment, Apal and Pin AI restriction enzymes 
are used. Preferably, the repUcation defective viral vector is the indicator gene viral vector 
("IGW"). In a preferred embodiment, the viral vector contains a means for detecting 
replication of the RTV. Preferably, the viral vector contains a hiciferase expression cassette. 
The assay can be performed by first co-transfecting host cells with RTV DNA and a 

10 plasmid that expresses the envelope proteins of another retrovirus, for example, amphotropic 
murine leukemia virus (MLV). Following transfection. virus particles can be harvested and 
used to infect ftesh target cells. The completion of a single round of viral repUcation can be 
detected by the means for detecting rq)lication contained in the vector. In a prefened 
embodiment, the completion of a single round of viral repUcation results in the production of 

15 ludfetase. Serial concentrations of anti-viral agents can be added at either the transfection 

step or the infection step. 

SusceptfeiUty to the anti-viral agent can be measured by comparing the repUcation of 
the vector in the presence and absence of the anti-viral agent For example, susceptibiUty to 
the anti-viral agent can be measured by comparing the luciferase activity in the presence and 
20 absence of the anti-viral agent Susceptible viruses would produce low levels of luciferase 
activity in the presence of antiviral agents, whereas viruses with reduced susceptibiUty would 
produce higher levels of luciferase activity. 

In preferred embodiments, PHENOSENSE™ is used in evaluating the phenotypic 
susceptibiUty of HIV-1 to anti-vind drugs. Preferably, the anti-viral drug is a protease 
25 inhibitor. More preferably, it is amprenavir. La preferred embodiments, the reference viral 
strain is HIV strain NL4-3 or HXB-2. 

In one embodiment viral nucleic acid, for example, HIV-1 RNA is extracted fiom 
plasma samples, and a fragment of, or entire vind genes could be ampUfied by methods such 
as, but not Umited to PCR. See, eg.. Hertogs et al., 1998. Antimicrob Agents Chenwther 
30 42(2):269-76. In one otample, a 2.2-kb fragment containing the mtire HIV-1 PR- and 

RT-ooding sequence is anqiUfied by nested reverse transcription-PCR. The pool of ampUfied 
nucleic acid, for example, ttie PR-RT-codmg sequences, is then cotransfected into a host ceU 
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such as CD4+ T lymphocytes (MT4) with the pGEMT3deltaPRT plasmid fiom which most 
of the PR (codons 10 to 99) and RT (codons 1 to 482) sequences are deleted. Homologous 
recombination leads to the generation of chimeric viruses containing viral coding sequences, 
such as the PR- and RT-coding sequences derived fiom HIV-1 RNA in plasma. The 

5 susceptibilities of the chimeric vimses to all currently available anti-viral ag^ targeting the 
products of the tiansfected geacs iproKT and/or PR inhibitors, for exanq>le), can be 
detennined by any cell viability assay known in the art For example, an MT4 
cell-3-(4,5-dimethyWiiazol-2-yl) -2,5-diphenyltetrazolium bromide-based ceD viability assay 
can be used in an automated system that allows higih sample throughput The profile of 

1 0 resistance to all the anti-viral agents, such as die RT and PR inhibitors can be displayed 
grsphically in a single PR-RT-Antivirogram. 

Other assays for evaluating the phenotypic susceptibility of a virus to anti-viral drugs 
known to one of skill in the art can be used. See, e,g., Shi and Mellors, 1997. Antimicrob 
Agents Chemother. 41(12):2781-85; Gervaix et aL, 1997, Proc Natl Acad Set U. & A, 

15 94(9):4653-8; Race et al, 1999, AIDS 13:2061-2068, incorporated herein by reference in 
their entirities. 

hi another embodiment, the susceptibility of a virus to tr^tment with an anti-viral 
treatment is determined by assaying the activity of the target of the anti-viral treatment in the 
presence of the anti-viral treatment In one embodiment, the virus is HIV, the anti-viral 
20 treatment is a protease inhibitor, and the target of the anti-viral treatment is the HIV protease. 
See, e.g., U. Patent Nos, 5.436,131, 6,103,462, incorporated herein by reference in their 
entirities. 

5.6 Correlating Phenotvpfc and Genotvpic Snsceotibflity 

Any method known in tiie art can be used to determine whetiier a mutation is 

25 correlated with a decrease in susceptibility of a virus to an anti-viral treatment and thus is a 
RAM according to the present invention. In one embodiment, P vahies are used to determine 
the statistical significance of tiie correlation, such that flie smaller the P value, the more 
significant Uie measurement. Prefersibly tiie P values will be less tiian 0.05. More preferably, 
P vahies will be less than 0.0 1 . P values can be calculated by any means known to one of 

30 skill in the art In one embodiment, P values are calculated using Fisher's Exact Test See, 
e.g., David Freedman, Robert Pisani & Roger Purves, 1980, Statistcs, W. W. Norton, New 
York. 
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In a preferred embodiment, numbere of samples with the mutation being analyzed that 
have an ICso fold change below or above 2.5-fold are c6mpared to numbers of samples 
without the mutation. A 2x2 table can be constructed and the P value can be calculated using 
Fisher's Exact Test {see Example 1). P values smaller than 0.05 or 0.01 can be cUissified as 
statistically significant 

5.7 neterminlng Sn«MPtibtUtv to the Antl-Vlral Treatment 

hi another aspect, the present invention provides a method for deteiminmg a virus' 
susceptibiUty to anti-viral treatment Resistance-associated mutations (RAMs) can be 
identified and correlated with reduced susceptibility of a virus to an anti-viral treatment as 
described in Sections 5.3-5.6 above. The presence of a RAM in a virus can be detected by 
any means known in the ait, eg., as discussed in Section 5.4.2 above. The presence of a 

RAM in the virus can indicate that the virus has an increased likelihood of having reduced 
susceptibility for the anti-viral treatment In one embodiment, the virus is human 
immunodeficiency virus (HIV). In another embodiment, the virus is human 
immunodeficiency virus type-1 (HIV-1). to another embodiment, the anti-viral treatment is a 
protease inhibitor. Examples of protease inhibitors include, but are not limited to, saquinavir, 
ritonavir, indinavir, nelfinavir, amprenavir and lopinavir. In one embodiment, the protease 
inhibitor is selected from a group consisting of saqumavir, ritonavir, indinavir, nelfinavir, 
amprenavir and lopinavir. 

bi another embodiment the invention provides a method for determinmg whether a 
HIV has an increased likelihood of having a reduced susceptibility to treatanent with a 
protease inhibitor, comprising detecting in the protease of said HIV the presence or absence 
of a mutation associated with reduced susceptibiUty to treatment with said protease inhibitor 
at amino add position 11, 32, 33, 34, 43, 46, 47, 48, 50, 54, 58, 71, 76. 79, 82, 83. 84. 91 or 
95 of an amino acid sequence of said protease, wherein the presence of said mutation 
indicates that the HIV has an increased likelihood of having reduced susceptibiUty to 
treatment with the protease inhibitor, with the proviso that said mutation is not V32I. M46I. 
M46L, 147V. I50V, I54L, I54M or I84V. 

in another embodiment the invention provides a method of determining whether a 
HIV has an increased Ukelihood of having a reduced susceptibility to treabnent with a 
protease inhibitor, comprising detecting in the protease of said HIV the presence or absence 
of amutotion selected from tiie group consisting of: VllI, VI IL, U3F, E34Q, K43T, G48H 
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I54A, I54S. I54T, Q58E, A71L, L76V, P79, V82A. V82F, N83D, I84A. I84C, T91 A, T91S, 
T91 V and C9SF, vtiierein the mutation is associated widi reduced susceptibility to treatment 
with said protease inhibitor and the presence of said mutation mdicates that the HIV has an 
increased likelihood of having reduced susceptibility to treatment with the protease inhibitor 
5 compared to a HIV without said mutation, a wild type or reference HIV. 

In another aspect, the present invention provides a method for detennining the 
susceptibility ofan individual infected with a virus to anti-viral treatment R^istance- 
associated mutations (RAMs) can be identified and correlated with reduced susceptibility of a 
virus to an anti-viral treatment as described in Sections 5.3-5.6 above. The presence of a 

1 0 RAM in a virus present in a sample from the individual can be detected by any means known 
in the art, e.g., as discussed in Section 5.4.2 above. The presence of a RAM in the virus can 
indicate that the individual has an increased likelihood of having reduced suscq>tibility for 
the anti-viral treatment. In one embodiment, the virus is HIV. In another embodiment, the 
virus is HIV-1 . In another embodiment, the anti-viral treatment is a protease inhibitor. 

1 S Examples of protease inhibitors inchide, but are not limited to, saquinavir, ritonavir, 

indinavir, nelfinavir, amprenavir and lopinavir. In one embodiment, the protease inhibitor is 
selected from a group consisting of saquinavir, ritonavir, indinavir, nelfinavir, amprenavir 
and lopinavir. In another embodiment, the protease inhibitor is amprenavir. 

In another embodiment, the invention provides a method of detennining whether an 
20 individual infected with HIV has an increased likelihood of having a reduced susceptibility to 
treatment with a protease inhibitor, comprising detecting, in a sanqile iBrom said individual, 
the presence or absence of a mutation associated with reduced suscq)tibility to treatment with 
said protease inhibitor at amino acid position 11. 32, 33, 34, 43, 46, 47, 48, 50. 54, 58, 71, 76, 
79, 82, 83, 84, 91 or 95 of die amino acid sequence of die protease of the HIV, wherein the 
25 presence of said mutation indicates that the individual has an increased likelihood of having 
reduced suscqitibility to treatmmt with the protease inhibitor, with the proviso that said 
mutation is not V32I, M46I. M46L, I47V, BOV, I54L, I54M or I84V. 

In anoth» onbodiment, the invention provides a method for detennining the 
effectiveness of protease inhibitor treatment of an individual infected with a HIV, comprising 
30 detecting, in a sample from said individual, the presence or absence of a mutation associated 
with reduced susceptibility to treatment with said protease inhibitor at amino acid position 1 1 , 
32, 33, 34, 43. 46, 47, 48. 50, 54, 58, 71, 76, 79, 82, 83, 84, 91 or 95 of the amino acid 
sequence of the protease of the HIV, wherein the presence of said mutation indicates that the 
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individual has a reduced susceptibiUty to treatment with said protease inhfl)itor, with the 
proviso that said mutation is not V32I. M46I, M46L. MTV. I50V, I54I, I54M or I84V. 

hi another embodiment, the invention provides a method of determining whether an 
individual infected with HIV has an increased likelihood of having a reduced susceptibility to 
treatment with a protease inhfcitor, comprising detecting in the protease of said HIV the 
presence or absence of a mutation associated with reduced susceptibiUty to treatment with 
said protease inhibitor selected fiom the group consisting of: VI II, VI IL. U3F, E34Q. 
K43T. G48M. I54A. I54S, I54T. Q58E. A71L. L76V. P79. V82A. V82F. N83D, 184A. I84C. 
T91 A. T91S, T91V and C95F, wherein the presence of said mutation indicates fliat the 
individual has an increased likelihood of having reduced susceptflrility to treatment with the 
protease inhibitor compared to an individual infected with a HIV without said mutation, eg., 
a wild type OT refermce HIV. 

5^ Constructing an Algorithm 

In one aspect, the present invention provides a method of constructing an algorithm 
that correlates gsnotypic data about a virus with phenotypic data about the virus. Li one 
embodiment, the phenotypic data relate to the susceptibility of the virus to an anti-viral 
treatment hi another embodiment, the anti-viral treatment is an anti-viral compound. In 
another embodiment, the anti-viral compound is a protease inhibitor. In another embodiment, 
tibfr protease inhibitor is amprenavir. 

In one embodunent, the method of constructing the algprithm compiises ^reatmg a 
rule or rules that coirelale genotypic data about a set of viruses with phenotypic data about 
the set of viruses. 

In one embodiment, a data set comprising genotypic and phenotypic data about each 
virus in a set of viruses is assembled. Any method known in the art can be used to coUect 
genotypic data about a virus. Examples of methods of coUecting such data are provided 
above. Any method known in the art can be used for coUecting phenotypic data about a 
virus. Examples ofsuch methods are provided above, hi a preferred embodiment, the data 
set comprises one or more RAMs as described above. In one embodiment, each genotypic 
datum is the sequence of all or part of a viral protein of a vuus in the set of viruses, hi 
another embodiment, each genotypic datum in the data set is a single amino acid change in a 
protein encoded by the vims, relative to a reference protem in the reference virus. In other 
embodiments, the genotype comprises two, three, four. five, six or more ammo acid changes 
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in the viral proteixi. In another einbodiment, the virus is HIV, and the 

protease. In aprefenred embodiment, tbe vims is HIV-1. In another embodiment, the 

reference protein is the protease from NL4-3 HIV. 

In one embodiment, each phenotypic datum in the data set is the susceptibility to an 
S anti-viral treatment ofa virus in the s^ of viruses. In one embodiment, the anti-virai 
treatment is an anti-viral compound hi another embodiment, the anti-viral compound is a 
protease inhibitor. In a preferred embodiment, the protease inhibitor is amprenavir. In one 
embodiment, the susceptibility is measured as a change in the susceptibility of the virus 
relative to a reference virus. In another embodiment, the susceptibility is measured as a 
10 change in the IC50 of &e vims relative to a reference virus. In another embodiment, the 
change in IC50 is represented as the fold-change in ICso. In certain embodiments die virus is 
HTV. In a preferred embodiment, the virus is HIV- 1. In another prefonred embodiment, die 
refermce HIV isNL4-3 HIV. 

The genotypic and phenotypic data in the data set can be represented or organized in 
IS any way known in the art. In one embodiment, the data are displayed in the form of a gr^h. 
In this type of rq)resentation, the y-axis represents the fold change m ICso of a virus in the 
data set relative to a reference virus. Each point on the grsph corresponds to one vims in the 
data set. The x-axis represents the number of mutatioiis that a virus in the data set has. The 
position of the point indicates both the number of mutations and the fold change in anti-viral 
20 therapy treatment that the virus has, both measured relative to a reference strain. In another 
embodiment, the genotypic and phenotypic data in the data set are displayed in the form of a 
chart. 

In one aspect, an algorithm is formulated that correlates the genotypic data with the 
phenotypic data in the data set. In one embodiment, a phenotypic cutoff point is defined. In 

25 a prefmed embodiment, the phenotype is suscq>tibiUty to an anti-viral treatment In another 
embodiment, the phenotype is change in sensitivity to an anti-viral treatment relative to a 
reference virus, and the cutoff point is the value above which a vims or population of viruses 
is defined as phenotypically resistant (TT-R'*) to the anti-viral therqsy and below which a 
virus or population of viruses is defined as phenotypically sensitive ('TT-S") to the anti-viral 

30 therapy. In some embodunents, the cutoff point is 2-fold, 2.S-fold, 3-fold, S-fold, 10-fold, 
IS-fold, 20-fold, 30-fold, 40-fold, 50-fold or 100-fold greater than the IC50 of a reference 
virus. In some embodiments, the phenotypic cutoff point is the clinical cutoff value as 
defined above. In a preferred embodiment, the virus is HIV and the anti-viral therapy is 
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treatmentwithaproteaseinhibitor. fa a preferred embodiment, the protease inhibitor is 
amprenavir. 

In some embodiments, the phenotypic cutoflfpoint is used to define a genotypic cutoff 
point In one embodiment this is done by correlating the number of mutations in a virus of 
the data set with the phenotypic susceptibility of the virus. This can be done, for exanqjle, 
using a graph similar to one discussed above. A genotypic cutoff point is selected such that 
most viruses having more than that number of mutations in the data set are phenotypically 
resistant ("PT-R'O, and most viruses having fewer than that number of mutations are 
phenotypically sensitive CTT-S"). By definition, a virus in the data set with number of 
mutations equal to, or more than the genotypic cutoflfis genotypically resistant CGT-R") to 
the anti-viral treatment, and a vims in the data set with fewer than the genotypic cutoff 
number of mutations is genotypically sensitive CGT-S") to the anti-viral treatment Thus, in 
one embodiment, a genotypic cutoffpoint is selected that produces the greatest percentage of 
viruses in the data set that are either phenotypically resistant and genotypically resistant 
("PT-R, GT-R"), or phenotypically sensitive and genotypically sensitive ("PT-S, GT-S"). 

While this algorithm can provide a usefiil approximation of the relationship between 
the genotypic and phenotypic data in the data set, in most cases there wiU be a significant 
number of strains that are gcnotypicaUy sensitive but phenotypicaUy resistant CGT-S, 
PT-R"). or genotypically resistant but phenotypically sensitive ("GT-R. PT-S"). Thus, in a 
preferred embodiment, the algorithm is finther modified to reduce the percentage of 
discordant results in the data set This is done, for example, by removing firom the data set 
each data point that corresponds to a virus population comprising a mixture of mutations 
inchidingthewUd-typcatasinglepositionconsideredbythealgoridmitested. Thiscan 

have the effect of reducing the number of PT-S. GT-R results, thus lowering the overall 
percentage of discordant results and so improves the fit of the algorithm to a data set 
In another embodunent differential weight values are assigned to one or more 
mutationsobservedinthedataset. An algorithm that does not include this step assumes that 
each mutation in the data set contributes equally to the overaU resistance of a virus or 
population of viruses to an anti-viral therapy. For example a mutation could be present in a 
data set that is ahnost always correlated with phenotypic resistance to an anti-viral treatment 
TTiat is. ahnost every virus that has the mutation is phenotypically resistant to the anti-viral 
treatment, even those strains having only one or two total mutations, fa one embodiment, 
such mutations are "weighted," i.e., assigned an increased mutation score. A mutation can be 
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assigned a weight of, for example, two, three, four, five, six, seven, eigjit or more. For 
example, a mutation assigned a weight of 2 will be counted as two mutations in a virus. 
Fractional weighting values can also be assigned. In another embodiment, values of less than 
1» and of less flian zero, can be assigned, wherein a mutation is associated with an increased 
5 sensitivity of the virus to the anti-viral treatment 

One of skill in the art wiU sq)preciate that there is a tradeoff involved in assignmg an 
inoeased weigiht to certain mutations. As the weight of die mutation is increased, the number 
ofGT-R,PT^ discordant results may increase. Thus, assigning a weight to a mutation that 
is too great may increase the overall discordance of die algorithm. Accordingly, in one 
1 0 embodiment, a weight is assigned to a mutation that balances the reduction in GT-S, PT-R 
disrordant results with the increase in GT-R, FT-S discordant results. 

In anodier embodiment, the interaction of different mutations in the data set with each 
other is also &ctored into the algorithm. For example, it might be found that two or more 
mutations behave synergistically, that the coincidence of the mutations in a virus 

1 S contributes more significantly to the resistance of the virus than would be predicted based on 
the effect of each mutation independent of the other. Altematively, it might be found that the 
coincidence of two or more mutations in a virus contributes less significantly to the resistance 
of the virus than would be expected fiom the contributions made to resistance by each 
mutation when it occurs independently. Also, two or more mutations may be found to occur 

20 more fiiequently togedier than as indq>endent mutations. Thus, in one embodiment, 

mutations occurring togedier are weighted together. For sample, only one of the mutations 
is assigned a weight of 1 or greater, and the other mutation or mutations are assigned a weight 
of zero, in order to avoid an incr^se in the number of GT*R, PT-S discordant results. 

In another aspect, the pbenotypic cutoff point can be used to define a gcnotypic cutoff 
25 point by correlating the number as well as the class of mutations in a virus of the data set with 
the pbenotypic susceptibility of die virus. Exanq>les of classes of mutations include, but are 
not limited to, primary amino acid mutations, secondary amino acid mutations, mutations in 
which the net charge on the polypeptide is conserved and mutations that do not alter the 
polarity, hydrophobidty or hydrophilicity of the amino acid at a particular position. Other 
30 classes of mutations that are within the scope of the invention would be evident to one of skill 
in the art, based on the teachings herein. 
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In one embodiment, an algorithm is constructed that factors in the requirement for one 
or more classes of mutations. In another embodiment, the algorithm fectors in the 
requirement for a minimum number of one or more classes of mutations. In another 
embodiment, the algprithm fectors in the requirement for a minimum number of primary or 
secondary mutations. In another embodiment, the requirement for a primary or a secondary 
mutation in combination with other mutations is also fectored into the algorithm. For 
example, it mi^it be found that a virus widi a particular combination of mutations is resistant 
to an anti-viral treatment, whereas a virus with any mutation in that combination, alone or 
with other mutations that are not part of the combination, is not resistant to the anti-viral 
treatment. 

By using, for example, the methods discussed above, the algorithm can be designed to 
achieve any desired result. In one embodiment, the algorithm is designed to maximize the 
overall concordance (the sum of the percentages of the PT-R. GT-R and the PT-S, GT-S 
groups, or 1 00 minus (percentage of the PT-S, GT-R + PT-R, GT-S groups). In preferred 
embodiments, the overaU concordance is greater dian about 75%. 80%. 85%, 90% or 95% 
In another embodiment, the algprithm is designed to minimize the percentage of PT-R, GT-S 
results. In another embodiment, the algprithm is designed to minimize the percentage of PT- 
S. GT-R results, to another embodiment, the algorithm is designed to maximize the 
percentage of PT-S. GT-S results, to another embodiment, the algorithm is designed to 
nuaimize the percentage of PT-R, GT-R results. 

At any pomt during the construction of the algorithm, or after it is constnicted, it can 
• be farther tested on a second data set. to one embodiment, the second data set consists of 
viruses that are not included in the data set used to construct the algorithm, i.c.. the second 
data set is a naive data set to another embodiment, the second data set contains one or more 
viruses that were m the data set used to constnict the algorithm and one or more viruses that 
were not m that data set Use of the algorithm on a second data set, particularly a naive data 
set, allows the predictive capability of the algprithm to be assessed. Tlius, m one 
embodiment, the accuracy of an algprithm is assessed using a second data set, and the rules of 
the algorithm are modified as described above to improve its accuracy, to a preferred 
embodiment, an iterative approach is used to create the algorithm, whereby an algprithm is 
tested and then modified repeatedly untU a desired level of accuracy is achieved. 

to one aspect, the constmction or implementation of the algorithm can begm with a 
few "starting mutations" and proceed m steps in which it factors in the presence of certain 
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mutations or classes of tnutations. In one embodiment, die algorifhm factois in the presence 
of either ISOV on its own or, any one or more of V32I, IS4L or IS4A or V plus two 
secondary mutations. Any ofthe secondary mutations listed in Table 5 can be used. Next, 
the algorithm factors in oAer mutations in addition to the starting mutations. The additional 

S mutations can include, e.g., 82F and I84C as well as any one or more of S4A, 54S or 54T. In 
one embodiment, the algorithm, in all future stages, factois in a minimum number of 
secondary mutations. In a more particular embodiment, the algorittm, in all fiiture stages, 
factors in at least 2 secondary mutations. The algorithm can then factor in the presence of 
additional mutations, e,g,, the combination of 33F and 82A. When the algorithm factors in 

10 the combination of 2 or more mutations, it is g^erally understood that both mutations, eg., 
33F and 82A, be present in the same virus (or sample). Finally, the algorithm can factor in 
additional combiiuitions, eg., the combination of 461 or 46L with any one or more of 47V, 
S4V, 7\L, 76V, or 82A. During fbt construction or implementation of an algorithm as 
described above, a decrease in the overall discordance as well as the percentage of data in the 

1 5 PT-R, GT-S group decreased with each step of the algorithm is indicative that the algorithm 
improved each time in correctly predicting the mutations and combinations of mutations that 
led to phenotypic resistance. 

5.9 Using an Algorithm to Predict the Snscentibilltv of a Vims 

In another aspect, the preset invention also provides a method for using an algorithm 

20 of the invention to predict the phenotypic susceptibility of a virus or a derivative, of a virus to 
an anti-viral treatment based on the genotype of the virus. In one embodiment, the method 
comprises detecting, in the virus or derivative of the virus, the presence or absence of one or ' 
more RAMs, qiplying the rules of the algorithm to the detected RAMs, wherein a virus that 
satisfies the rules of the algorithm is genotypically resistant to the anti-viral treatmmt, and a 

25 virus that does not satisfy the rules of the algorithm is genotypically sensitive to the anti-viral 
treatmmt. In anotho- raibodiment, the method comprises detecting, in the virus or derivative 
of the virus, the presence or absence of one or more RAMs, allying the rules of the 
algorithm to the detected RAMs, wherein a score equal to, or greater than flie genotypic 
cutofT score indicates that the vims is genotypically resistant to the anti-viral treatment, and a 

30 score 1^ than the genotypic cutoff score indicates that the virus is genotypically sensitive to 
the anti-viral treatment 
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The algorithm of this invaitioii can be used for any viral disease where anti-viral drag 
susceptibittty is a concern, as discussed above in Section 5.4.1. In certain embodiments the 
assay of the invention can be used to determine the susceptibility of a retrovirus to an anti- 
viral drug. In a preferred embodiment, the retrovirus is HIV. Preferably, the virus is HIV-1. 

5 The anti-viral agent of the invention could be any treatment effective against a virus. 

It is useful to the practice of this invention, for example, to understand the structure, life cycle 
and genetic elements of the viruses which can be tested in the drug susceptiWUty test of this 
invention. These would be known to one of ordmary skiU in the art and provide, for example, 
key enzymes and other molecules at which the anti-viral agent can be targeted. Examples of 

10 anti-viral agents of the invention include, but are not limited to, nucleoside reverse 
transcriptase inhibitors such as AZT, ddl, ddC, d4T, 3TC, abacavir. nucleotide reverse 
transcr^tase inhibitors such as tenofovir. non-nucleoside reverse transcriptase inhibitors such 
as nevirapine, efevirenz, delavirdine, fusion inhibitors such as T-20 and T-1249 and protease 
inhibitors such as saquinavir, ritonavir, indinavir, nelfinavir, amprenavir and lopinavir. 

1 5 In some embodiments of the invention, the anti-viral agents are directed at 

retroviruses. In certain embodiments, the anti-viral agents are protease inhibitors such as 
saquinavir, ritonavir, indinavir, nelfinavir. amprenavir and lopinavir. In a preferred 
embodiment, ibs anti-viral agent is amprenavir. 

Some mutations associated with reduced susceptibiUty to treatment with an anti-viral 

20 agent are known in the art. &e. eg., Maguire et al., 2002, Antimicrob Agents Chemother 
46:731-738. Others can be determined by methods described in Sections 5.4-5.8 above. For 
example. Table 1 provides a list of mutations associated with reduced susceptibiUty to 
amprenavir. 

5.10 Using an Algorithm to Predict tiie Effectiveness of 
25 And-Vlral Treatment for aw Individual 

In another aspect, the present invention also provides a method for using an algorithm 

of the invention to predict the effectiveness of an anti-viral treatment for an individual 

infected with a virus based on the genotype of the virus to the anti-viral treatment In one 

embodiment, the method comprises detecting, in the virus or derivative of the virus, the 

30 presence or absence of one or more RAMs, applying the rules of the algorithm to the detected 

RAMs, wherein a virus that satisfies the rules of the algprithm is genotypically resistant to the 

anti-viral treatment, and a virus that does not satisfy the rules of the algorithm is 
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genotypically sensitive to the anti-viral treatment In another embodiment, the method 
comprises detecting, in the virus or a derivative of the virus, the presence or absence of one 
or more RAMs, applying the rules of the algorithm to the detected RAMs, wherein a score 
equal to, or greater than the goiotypic cutoff score indicates that the virus is genotypically 
5 resistant to the anti-viral treatment, and a score less than the genotypic cutoff scoie indicates 
that the virus is gmotypically sensitive to die anti-viral treatment. 

As described in Section 5.4.1 above, the algorithm of die invention can be used for 
any viral disease where anti-viral dmg susceptibility is a concern and the anti-viral agent of 
the invention could be any treatment effective against a virus. In certam embodiments the 

10 assay of the invention is used to determine the ^cq)tibility of a retrovirus to an anti-viral 
drag. In a preferred embodimrat, the retrovirus is HIV, Preferably, the virus is HIV-1. In 
some embodiments of the invention, the anti-viral agents are directed at retroviruses. In 
certain embodiments, the anti-viral agents are pn>tease iiihibitoTS such as saqpiinav^^ 
ritonavir, indinavir, nelfinavir, amprenavir and lopinavir. In a prefened embodiment, the 

1 5 anti-viral agent is amprenavir. 

As described in Section 5.9 above, mutations associated with reduced suscq>tibility to 
treatment with an anti-viral agent may be obtained firom the art or detennined by methods 
described above in Sections 5.4 - 5.8. 

In some embodiments, the present invention provides a method for monitoring the 
20 effectiveness of an anti-viral treatment in an individual infected with a virus and^undergoing 
or having imdergone prior treatment with the same or different anti-viral treatment, 
comprising, detecting, in a sample of said individual, die pres«ce or absence of an amino 
acid residue associated with reduced susceptibiUty to treatment the anti-viral treatment, 
wherein the presence of the residue correlates with a reduced susceptibility to treatment with 
25 the anti-viral treatment. 

5.11 Correlating Susceptibility to one Anti-Viral Treatment 
with Snsceptibllitv to Anodicr Anti-Viral Treatment 

In another aspect, the present invention provides a method for using an algorithm of 
the invention to predict the effectiveness of an anti-viral treatment against a virus based on 
30 the genotypic susceptibility ofthe virus to a different anti-viral treatment In one 

embodiment, the method comprises detecting, in a virus or a derivative of a virus, the 
presence or absence of one or more mutations correlated with resistance to an anti-viral 
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tieatment and applying the rules of an algorithm of the invention to the detected mutations, 
wherein a vires that satisfies the reles of the algorithm is genotypicaUy resistant to the anti- 
viral treatment, and a virus that does not satisfy the rales of the algorithm is genotypicaUy 
sensitive to the anti-viial treatment In another embodiment, the melhod comprises detecting, 

5 in the virus or a derivative of the virus, the presence or absence of one or more mutations 
correlated with resistance to an anti-vind treatment and applying the rules of the algorithm to 
the detected mutations, wherein a score equal to. or greater than the genotypic cutoflf score 
indicates that th^ virus is genotypicaUy resistant to a different anti-viral treatment, and a score 
less than the genotypic cutoif score indicates that the virus is genotypicaUy sensitive to a 

10 diffeitmt anti-viral treatment. In another embodiment, the two anti-viial treatments aflfect the 
same viral protein. In anoflier embodiment, the two anti-viral treatments are both protease 
inhibitors. Examples of protease inhibitors include, but are not Umited to, saquinavir, 
ritonavir, indinavir, nelfinavir. amprenavir and lopinavir. In another embodiment, one of the 
two anti-vind treatments is amprenavir. In another embodiment, a mutation correlated with 

15 resistance to one protease inhibitor is also correlated with resistance to another protease 
inhibitor. 

6. EXAMPLES 

The following examples are provided to iUustrate certain aspects of the present 
invention and not intended as Umiting the subject matter thereof! 

20 6.1 Example 1: Analysis ofPatleBt Samples to Identlly 
i»«.i«tmiei>.AMociated M utations 

This example demonstrates a method of analyzing patient samples so as to identify 
mutations that are associated either with increased or with decreased susceptibiUty to protease 
inhibitors sudi as amprenavir. 
25 In order to determine the retationship between an HIV-1 strain's protease sequence 

and its susceptfeiUfy to treatment with amprenavir. a data set of 10,5 13 samples was used. 
From this data set. dl -VUd type" samples, samples with FC < 2 for aU protease 

inhibitors and no drug selected mutation were removed. Figure 2 shows a flow-chart 
depicting the steps followed to arrive at a final set of samples that was analyzed in detail. 
30 nie drug-selected PI mutations used as a criterion for removing a sample were those that 
occurred at amino add positions 23, 24. 30, 32. 33F. 46. 48, 50, 53, 54. 82 (except 82D. 84, 
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88 or 90 of the HIV protease (Figure 2). Next, samples with the same gmotype were 
removed fiom the data set This resulted in a data set of4414 patient plasma samples that 
was analyzed genotypically as well as phenotypicaUy. 

The phenotypic assay was conducted using the PHENOSENSE™ (Virologic, South 
5 San Francisco, CA) HIV assay (Petropoulos et al.^ 2000, Antimicrob. Agents Chemother. 
44:920-928; U.S. Patent Nos. 5,837,464 and 6,242,187). ICso values for ampienavir were 
obtained for the HIV-1 fiom the patient sample. This was compared to die ICso for 
anqxrenavir against the NL4-3 (GenBank Accession No. AF324493) reference viral strain. 
Phenotypic data were expressed as '*foId change" (or log fold change) m 50% inhibitoiy 

10 concentration ( IC50) of amprenavir. The fold IC50 values were calculated by dividing the 
IC50 of ampr^vir against the HIV-1 fit>m the patient plasma sample by the ICso for 
amprenavir against the NL4-3 (GenBank Accession No. AF324493) reference viral strain. 
The cut-off used to define phenotypic resistance to APV was 2.S-fold. This cut-off value has 
not been derived from clinical outcome studies, however the 2.S-fold threshold is meaningful 

1 S because: strong correlations exist between phenotypic results using this cutoff and clinical 
response data in cohorts that included patients receiving amprenavir (Haubrich et ai, 2001, 
Antivir Ther 6 (suppl 1):63; Katzenstein et al., 2002, 9th CROI, Seattle, WA); reductions in 
susceptibility to amprenavir observed in patients who experienced viral load rebound while 
using amprenavir as their first PI are modest (as low as 2-3 fold) (Maguke et al^ 2002, 

20 Antimicrob Agents Chemother 46:73 1-738); and the 99di percentile for the distribution of 
amprenavir FC in genotypically wild-type viruses using the PhenoSense™ Assay was 
2.1-fold. For those mutants that had an amprenavir FC >2.S, %R and %S values were 
calcidated according to the formulae: 

%R - (number of samples with mutant that were PT-R) / (total no. of PT-R samples) 
25 %S - (numbs of samples with mutant that were PT-S) / (total no. of PT-S sanq)les) 

hi order to define the genotypic changes correlated wi& reduced suscq>tibility to 
amprenavir^ the entire amino acid sequences of HIV proteases in each of the patients' 
sanq>les were analyzed. Mutations were compared to the protease sequence of the NL4-3 
30 (GenBank Accession No. AF324493) reference strain. All positions that were mutated in at 
least 1% of the 4414 samples (i.e., in at least 44 samples) were analyzed in detail. Mixtures 
yreie counted as mutants. In some cases, different amino adds at the same position were 
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grouped togeflier (e.g., position 67), whereas at other positions (e.g., position 82), the 
different nnitations (e.g., V82A, F, S or T) were counted separately. 

P values were calculated for detcnnining the statistical significance of the phmotypic 
and genotypic correlations. For each mutation the number of sanq)les in the data set that had 

5 an APV FC < 2.5 or an APV FC > 2.5 were conq)ared in samples with or without ttie 
mutation in question. A 2x2 table was constructed and the P value was calculated using 
Fisher's Exact test Mutations that had a P < 0.001 and %R/%S > 3 were considered to be 
associated with reduced susceptibility to amprenavir, while mutations that had a P < 0.001 
and a %R/%S < 0.3 were craiaidered to be associated with increased suscqitibility to 

10 amprenavir. Other values for P, e.g., P < 0.05 and %R/%S ratios, e.g., %R/%S > 4 or 5 and 
%R/%S < 0.25 or 0.2, which are witiim the scope of the mvention, will be evident to one of 
skiU in tiie art based on the teachings herein. Table 2 provides the entire Mst of mutations that 
were analyzed and Table 1 provides a list of mutations that were associated with either 
reduced or increased susceptibiKty to amprenavir {Le., those mutations witii P < 0.001 and 

15 %R/%S>3or%R/%S<0.3). 

6J Example 2: Correlation of Amprenavir Susceptibility 
to the Mntatioaa in HIV Protease 

This exanqile demonstrates tiie construction of an algoritimi fliat correlates the 
mutations in flie protease gene of an HIV with its suscq)tibility to an^renavfar. 

20 A data srt of 4414 patient plasma samples was analyzed and mutations associated 

with reduced susceptibility to ampraiavir were identified, as described in Example I. The 
phenotypic susceptibility to amprenavir (amprenavh: fold change) was analyzed as a function 
of the number of mutations in the protease of the HIV present in a patient's plasma sample. 
The fold change for each sample was calculated by dividing the IC50 of amprenavir against 

25 the HIV fiom the patient's plasma sample by the IC50 for amprenavir against the NL4-3 

(GenBank Accession No. AF324493) reference viral strain. The genotype data was obtained 
by sequencing the protease of the HIV present in each patient's sample and detennining the 
sequence changes with respect to the sequence of the NL4-3 (GenBank Accession No. 
AF324493) HIV. The amino sequence for the NL4-3 protease is provided in SBQ. ID. No. 1 

30 (Figure 3A) and the nucleic acid sequence for the NL4-3 protease gene is provided in SEQ. 
ID. No. 2 (Figure 3B). 
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The mutations used in a prelinunaiy round of analysis were the primary ampreoavir 
mutations I54M, I50V, V32I, IS4L, 184V (identified by Maguire et al, 2002, Antimicroh 
Agents Chemother 46:731-738) and I84A. Sanq>les were defined as genotypically resistant 
("GT-R'O if any of the above mutations (V32I, I50V, I54L or M, or I84A or V) were present 
S Data were analyzed using univariate analysis (P<0.001 by Fisher's exact test was considered 
significant) and regression tree (CART) analysis (Statview 5.0 software; SAS, Gary, NC). 

The results of dus analysis were divided into four groups. Viruses that contain none 
of flie above-identified mutations in their protease and are phenotypically and genotypically 
sensitive (PT-S, GT-S) to aiiq>renavir were present in 1635, or 37% of the 4414 samples. 

10 Viruses that contain at least one of flie abovMdentified mutations and are phenotypically and 
genotypically resistant (PT-R, (jT-R) to anqirenavir (APV FC >2.5; Log anq)renavir fold 
change > 0.398) were present in 1698, or 38.5% of the samples. The other two groups 
corr^pond to the "excq>tioiis" where a virus was predicted based on genotype (number of 
mutations) to be susceptible, but was phenotypically (based on Log amprenavir fold change) 

IS resistant (PT-R, GT-S) or where a virus was predicted based on genotype to be resistant, but 
was phenotypically (based on Log anqirenavir fold change) susceptible (PT-S, GT-R). 

Nearly one quarter of the initial gmotypic interpretations were discordant with the 
observed phenotypic results. 865 samples, corresponding to 1 9.6% of die samples lacked 
any of the above-identified mutations, but, contrary to expectations, wm found to be 
20 phenotypically resistant to amprenavir (PT*R, GT-S). Conversely, some viruses that had 

one or more of the above-identified mutations did not exhibit any greater resistance to 
anq>renavir than did the WT strain (PT-S, GT-R) (216 samples (4.9%)). 

6.3 Example 3; Analysis of the PT-R, GT-S Discordant Group 
This example dononstrates that certain mutations and certain combinations of 
25 mutations make a greater contribution to amprenavir resistance than others. 

The samples in the PT-R, GT-S groiq> of Example 2 correspond to viruses with no 
Imown primary mutations in the HIV protease associated with reduced suscq>tibility to 
anqnenavir (te, none of V32I, I50V, I54L or M, or I84A or V). These viruses were 
phenotypically resistant (had an anq>renavir fold change greater than 2.5) but were predicted 
30 to be genotypically sensitive (because they had none of the known primary mutations). Since 
the majority of die discordance for anq)renavir is of the PT-R» GT-S type, the development of 
the algorithm focused on mutations which were associated with this phenotype, and not those 
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lhatdinplyappeaiedalongwithaknownmutatioa Therefore, samples with known 
inutations(Gr-R)wei«iemoved. This resulted in a total of 2499 lemaiiring samples. 34%of 
diese had APV FC > 2.5 (Figure 4). 

CART (Classification and Regression Tree) analysis enabled the identification of the 

5 most important variables for defining APV reduced susceptibility. Figure 4 shows the tree 
generated by die analysis. The analysis resulted in the identification of a list of mutations 
that contribute to reduced susceptibiUty to amprenavir VllI, L, V32I, U3F, E34Q. K43T, 
MTV. G48M. KOV. I54M. I54S, I54A. I54L» I54T. A71L, L7iSV. V82F. I84V and T91 . 
Further, the analysis also enabled the identification of some mutations that were not 

10 independenfly associated with reduced susceptibility to amprenavir, but were likely to be 

associated with reduced susceptibiUty to amprenavir in combination with other mutations. 
This list included: LIOF. LI 01. L24I. E35. M46I. M46L, G48V. F53L. I54V. Q58E. C67. 
A71V, G73, V82S. V82A, I84A and L90M. AU possible pairs of these mutations were tested 
and Fisher's Exact test performed, as described above. The results are summarized in Table 
15 3 as weU as in Figure 5. Figure 5 is a matrix of pairs of mutations associated with resistance 
toamprenavir. The numbers in each cell are the odds ratio (%R:%S) for tiiat pair of 

mutations and the numbers in brackets indicate the number of samples with that pair. Only 
those cells for wWch the corresponding pair hadP<0.001 (as detemained by Fisher'sexact 

test) have numbers in them. 
20 Pairs ofthe above mutations with strong associations with reduced susceptibiUty to 

amprenavir were tested again by CART analysis as shown in Figure 6. 

Thus, it is evident that the PT-R, GTS group can be associated with flie presence of 
mutations that are not independenfly associated with reduced susceptibiUty to amprenavir. but 
contribute to reduced susceptibiUty to amprenavir m combination with other mutations. 

25 6.4 FMrnnle 4! AifTrithm and Bemonstration oflts Accnracy 

nris example demonstrates the constniction of an algorithm tiiat reduces the 
incidence of FT-R. GT-S results by requiring certain mutations, classes of mutations and 
combinations of mutations. 

As described in Example 3. fiom a starting data set of 4414 samples, those samples 
30 witiiAPVGT.Rwcreremoved.resultinginadatasetof2499samples. The final rules were 
formulated based on the results observed with both data sets, those with 4414 and 2499 
samples (these data were called the •draining date"). The accuracy ofthe devised rules or 
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algoiitbm was evaluated based on the accuracy with v/bi<ii the susceptibility of the viruses 
could be determined based solely on the al^rilhm. When discrepancies were seen between 
the predictions and the observed results, the algorithm was modified so that it remained 
consistent with the observed results. The rules devised from the training data were then 

5 tested on a 'Validation data set" of patients. The validation data set was obtained by starting 
wiAadatasetof ll»768saii9les. From this data set, following the st^s described in 
Exaniple 1, all saxxq>les with FC< 2 for aU protease inhibitors and no drug selected mutation 
were removed. Next, sanqiles with the same genotype were removed. Finally samples that 
were present in the training data set were excluded. This resulted in a validation data set of 

10 1634 sanqples that was used to test the accuracy of the algorithm derived using only the 
training data set 

Table 4 provides a summary of die rules qyplied at each round or version of the 
development of the algorithm and the results obtained for the training data set with 4414 
samples and the validation data set with 1634 samples. The first column provides the rules 
1 5 used for each round of testing. The rules are cmnulative, i.e.» the rules for each round is 
added to the rules for the round preceding it 

The next four colunms provide, in order, the number of samples in the PT-S, GT-S, 
thePT-R,GT-R,theFT.R,GT-SandthePT-S,GT-Rgroi?)s. The number in tfxe PT-S, 
GT-R column excludes the PT-S, GT-R samples associated with mixtures at a primary or at 
20 least two secondary positions. Because mixtures were counted as mutants, a sample wiA say, 
less than 50% mutant, would be counted as a mutant for genotypic purposes, even though its 
phenotypic resistance may not be as high as a true mutant This gives rise to more samples in 
the PT-S, GT-R group, than expected based on the rules alone. 

The next 3 columns of Table 4 provide, in order, the percentage of sanq)les that are in 
25 the PT-R, GT-S and the PT-S, GT-R groups and flie overall discordance (the sum of the 

percentages of the PT-R, GT-S and the PT-S. GT-R groiq)s, or 100 - (percentage of the PT-S, 
GT-S + PT-R, GT-R groups). 

The al^rithm began with requiring the ''starting mutations,*' ^ch refers to either 
I50V on its own or, any one or more of V32I, I54L or M, I84A or V plus two secondary 
30 mutations. Any of the secondary mutations listed in Table 5 can be used. In the next round, 
82F and I84C as well as any one or more of 54A, 54S or 54T were added. From this round 
on, at least 2 secondary mutations ("2mut'* in Table 4) were needed. In the next round the 
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combination of 33F and 82A was added. This required that both, 33F and 82A be present in 
the same virus (or sample). In the last round, the combination of 461 or 46L with any one or 
more of 47V, 54V, 71L, 76V, or 82A was added. 

The ovoall discordance as well as the percentage of data in the PT-R, GT-S groiq) 
5 decreased with each round ofthe algorithm, indicating that the algorithm improved each time 
in correctly predicting the mutations and combinations of mutations that led to phenotypic 
resistance. For the training daU set, Ae total discordance decreased significantly, ftom 
24.5% in the first round to 14.7% in the last round and the number of samples in the PT-R, 
GT-S ffovp cone^dingly decreased fiwm 19.6% to 5.9% of the total 8anq>le8. 
10 The validation data set had a lower starting total discordance (15.8%) and fewer 

samples in the PT-R. GT-S group (12.8%) as compared to the training data set. Without 
being bound by theory, it is beKeved that this was because the vaUdation data set contained 
fewer samples with complex viral protease genotypes. The accuracy ofthe algorithm, 
developed using samples only form the training data set, to correctly predict the susceptftiUty 
15 ofany virus to amprenavir is demonstrated by its performaoce on the vaUdation data set The 
appUcation ofthe final algorithm. i.e.. the roles in the last row ofthe training data set in 
Table 4 (dmoted as "final algorithm" in tiie validation data set section) reduced the total 
discordance as weU as the percentage of data in the PT-R, GT-S group significantly, firan 
15.8% to 10.3% and bom 12.8% to 4.4%, respectively 

20 6.5 F-MHi pte S: Effect ofthe N8 8S Mutation 

This example demonstrates that N88S can re-sensitize viruses contaming mutations 
ttiat are strongly associated with reduced susceptibility to anqirenavir. 

N88S is a protease inhibitor resistance mutation and is selected in ntro by atazanavir 
(Gong et al. , 2000, Antimicrob Agents Chemother 44:23 19-26). It is also seen fi)llowing 
25 treatment with indinavir (Condra et al, 1996. J. Viral. 70:8270-8276). As can be seen ftom 
Tables 1 and 2, the HIV protease mutation N88S is correlated with an increased susceptibiUty 
of a virus in which it is present to amprenavir. I50V, on the other hand, has the opposite 
effect and is correlated with reduced susceptftiUty. A sample fiom an HIV-infected patient 
was analyzed genotypically and phenotypically. The sample was found to contain LIOI, 
30 I13V. E35D, M36I, R41K, I50V, L63P. A71V, N88N/S (».e., a mixture of N and S at position 
88), and L90M mutations in HIV protease. Phenotypically, the foUowing fold changes in 
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IC50 to diflferait protease inhibitors were seen: amprenavir: 4.1; indinavir: 1.4; 
lopinavir: 3.3; nclfinavir. 5.3; ritonavir 11; and saquinavir 3.6. 

aones ftom the population were selected so as to isolate some clones with ttie N88S 
mutation and some without it Resistance test vector pools constructed finom the original 

5 plasma sanq>le (Petropoulos et aU 2000, Antimicrob Agents Chemother 44:920-8) were 
transf omied into E. ColU and plasmid DN A fiom individual colonies was screened in the 
PhenoSense^ assay for protease genotype. A total of 21 clones were isolated, 3 with only 
I50V, 1 5 with I50V and L90M, and 3 with I50V and N88S, in addition to ttie other mutations 
present in flie pool. Figure? summarizes the phOTOtypicsusceptiWty to the diflFerent 

10 protease inhibitors as mean fold changes (FQ for each group of clones. Clones with I50V 
only, or I50V plus L90M, displayed marked reductions in susceptibility to amprenavir 
(13-17 FC), lopinavir (7-9 FC), nelfinavir (7-10 FC), and ritonavir (12-20 FC). The effect of 
L90M was small m each case, excq)t for saquinavir (1.8-fold increase in FC with L90M). 
However, when N88S was present in combination with I50V, anqwrenavir susceptibility was 

15 increased by a factor of 14.5. As seen in Figure 7, the fold change for a virus containing 
I50V. but not N88S (bar 1 for APV) is about 17-foId. However, the fold change dropped to 
wUd type level (1.2 FC) when the virus contained N88S in addition to I50V (bar 2 for APV). 
Tlie levels of lopinavir and ritonavir suscq)tibility were also increased (by a 1 .7-fold and a 
L4-fold decrease in IC50, respectively) in the presence of N88S. Conversely, susceptibility to 

20 nelfinavir and atazanavir decroased in the presence of N88S, as is seen by a 2. 1-fold and a 
2.3-fold increase, respectively, in IC50. 

These results demonstrate that N88S can completely re-s^itize HIV-1 containing the 
150 mutation to amprenavir. Thus, if N88S can be maintained, future treatment options for 
patients who harbor I50V-containing vims may include amprenavir, pcihaps in combination 

25 wia ritonavir. As seen in the Figure 7, N88S also lowered the level of resistance to lopinavir 
imparted by I50V. The congruence of directionality in the effect of N88S on amprenavir and 
lopinavir is consistent witfi observations regarding cross-resistance between these two Pis. 

All references cited herein are incorpomted by refa?Mice in their entireties. 
30 The examples provided herein, bofli actual and prophetic, are merely embodiments of 

the present invration and are not mteoded to limit the invention in any way. 
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TABLE! 

Mntfltiong AjMQglated with Reafatance tn Amnrenavir 











FT-R,wt 


P value 


%mtS 


%mtR 


%R/%S 


G48M 


1 


1849 


34 


2530 


AAA1 
<U.000l 


U.U 


t 3 


24.5 


I$4S 


2 


1848 


63 


Z3Vl 




0.1 


2.5 


22.7 


I50V 


9 


1841 


187 


Zj/ / 




0.5 


7.3 


15.0 


T91ASV** 


5 


1845 


102 






OJ 


4.0 


14.7 


MTV 


20 


1830 


327 




<ff\ AAA] 


1.1 


12.8 


11.8 


VllIL** 


18 


1832 


269 






1.0 


10.5 


10.8 


V32I 


28 


.1822 


ICO 

358 


ZaUD 




1.5 


14.0 


9.2 


E34Q 


15 


1835 


157 






0.8 


6.1 


7.6 


U3F 


81 


1769 


819 


1 /U 


<nnooi 


4.4 


31.9 


73 


A71L 


4 


1846 


3o 




<fionoi 


0.2 


1.5 


6.8 


L76V 


14 


1836 


132 






0.8 


5.2 


6.8 


I54A 


9 


1841 


78 






0.5 


3.0 


6.2 


IS4L 


32 


1818 


266 


ZZ70 


^ AMI 


1.7 


10.4 


6.0 


K43T 


46 


1804 


365 




<^A nAAl 


2.5 


14.2 


5.7 


I84V 


167 


1683 


1122 


1442 


AAAl 


3F.V 


43.8 


4.9 


I54T 


11 


1839 


72 


2492 


W| AAAl 




2.8 


4.7 


V82F 


17 


1833 


102 


2462 


^ nnni 
<U<UvUl 


AO 


4.0 


4.3 


C95F 


27 


1823 


141 


2423 


AAAl 

<0.0001 




5 5 


3.8 


P79** 


19 


1831 


96 


2468 


<0.0001 


1 A 
1.0 


J./ 


36 


N83D 


11 


1839 


54 


2510 


0.0001 


0.6 


2.1 


33 


QS8E 


82 


1768 


385 


2179 


<0.0001 


4.4 


15.0 


3.4 


I84A 


2 


1848 


9 


2555 


0.1343 


0.1 


0.4 


3.2 


I84C 


1 


1849 


8 


2556 


0.0891 


0.05 


0.31 


5.8 


E65D* 


41 


1809 


19 


2545 


<0.0001 


2.2 


0.7 


0.3 


D30N» 


357 


1493 


67 


2497 


<0.0001 


19J 


2.6 


0.1 


N88S* 


91 


1759 


4 


2560 


<0.0001 


4.9 


0.2 


0.03 



5 ♦ Associated with increased susceptibility (sensitivity) to amprenavir. 

All variants treated equally. 
Number of samples = 4414. 

%R: Perceat of 8anq)les with mutation compared to all PT-R, GT-S samples. 
%S: Percent of samples with mutation compared to all PT-S, GT-S samples. 
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What is claimed is: 

1 . A mediod for detennining wheflier a human immunodeficiency vinis QSW) has an 
increased likelihood of having a reduced susceptibility to treatment with a protease 
inhibitor, comprising: detecting whether the protease encoded by said HIV exhibits 
the presence or absence of a mutation associated with reduced susceptibility to 
treatment with said protease inhibitor at amino acid position 1 1, 32, 33, 34, 43, 46, 47, 
48, 50, 54, 58, 71, 76, 79, 82, 83, 84, 91 or 95 of an amino acid sequence of said 
protease, wherein the presence of said mutation indicates that the HIV has an 
increased likelihood of having reduced susceptibility to treatment with the protease 
inhibitor, with the proviso that said mutation is not V32I, M46I, M46L, MTV, I50V, 
I54L,I54MorI84V. 

2. A method ofdetermining whether an individual infected with HIV has an increased 
likelihood of having a reduced susceptH)iUty to treatment with a protease inhibitor, 
comprising: detecting, in a sample fiom said individual, the presence or absence of a 
mutation associated with reduced susceptibiUty to treatment with said protease 
inhibitor at amino add position 11, 32, 33, 34, 43, 46, 47, 48, 50. 54, 58, 71, 76, 79, 
82, 83, 84, 91 or 95 of the amino acid sequence of the protease of the HIV, wherein 
the presence of said mutation indicates that the individual has an mcreased likelihood 
of having reduced suscq>tibility to treatment with fhe protease inhibitor, wifli the 
proviso that said mutation is not V32I. M46I, M46L, I47V. KOV, I54L, EMM or 
I84V. 

3. A method ofdetermining whether a HIV has an increased likelihood of having a 
reduced susceptibility to treatment witti a protease mbibitor. conqirising: detecting 
whether the protease encoded by said HIV exhibits the presence or absence of a 
mutation selected fiom the ffoitp consisting of: VllI, VllL, L33F, E34Q, K43T, 
G48M, I54A, I54S, I54T, Q58E. A71L, L76V, P79, V82A. V82F. N83D, I84A, I84C, 
T91A, T91S, T91V and C95F, wherein the mutation is associated with reduced 
susceptibility to treatment with said protease inhibitor and the presence of said 
mutation indicates that the HIV has an increased likelihood of having reduced 
su8cq>tibility to treatment with the protease inhibitor. 
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4. An isolated oligonucleotide between about 10 and about 40 nucleotides long encoding 
a portion of a HIV protease in a HIV that con4>iises a mutation at amino acid position 
1 1, 32. 33, 34. 43, 46. 47, 48. 50. 54. 58. 71, 76, 79, 82. 83. 84, 91 or 95 of an amino 
acid sequence of said protease m said human immunodeficiency virus, wherein the 
mutation is associated wiA reduced suscqitibility to a protease inhibitor, with the 
proviso that said mutation is not V32I, M46I. M46L, I47V, I50V. IS4L, I54M or 
I84V. 

5. An isolated oli^nucleotide between about 10 and about 40 nucleotides long encoding 
a portion of a HIV protease that conqirises a mutation at codon 1 1, 32, 33, 34, 43, 46, 
47, 48. 50, 54. 58. 71, 76, 79. 82, 83, 84, 91 or 95, wherein the mutation is associated 
with reduced susceptibility to said protease inhibitor, with the proviso that the codon 
does not encode V at position 32, 1 or L at position 46, V at position 47, V at position 
50, L or M at position 54 or V at position 84, 

6. A meOiod for determining whether a HIV has an increased likelihood of having 
reduced suscq>tibility to treatment with a protease inhibitor, comprising: 
detecting whether the protease encoded by said HIV exhibits the presence or absence 
of one or more HTV protease mutations listed in Table 1 ; and 

q»plying a set of rules to said mutations as provided in Table 4, 

wherem said HIV has an increased likelihood of being resistant to treatment with said 

protease inhibitor if said set of rules is satisfied. 

7. A method for determining whether an individual infected widi a HIV has an increased 
likelihood of having reduced susceptibiUty to treatment with a protease inhibitor, 
comprising: 

detecting, in a sample &om said individual, the presence or absence of one or moro 

HIV protease mutations listed in Table 1; and 

cvplying a set 6f rules to said mutations as provided in Table 4, 

wherein said individual has an increased likelihood of being resistant to treatment 

with said protease inhibitor if said set of rules is satisfied. 

8. The method of claim 1. 2, 3, 6 or 7, wherein said human immunodeficiency virus is 
human immunodeficiency virus type 1 (HIV-1). 
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9. The method of claim 1 , 2, 3, 6 or 7, wherein the protease has a sequence that is greater 
than 90% identical to SEQ ID NO: 1. 

10. The method of claim 1, 2, 3, 4, 5, 6 or 7, wherein said protease inhibitor is 
amprenavir. 

11. The method of claim 1, 2, 3, 6 or 7, wherein said presence or absence of said mutation 
is detected by hybridization with a sequence-specific oligonucleotide probe to a 
nucleic acid sequence of said human immunodeficiency virus encoding said mutation, 
wherein the occurrence of hybridization indicates said presence or absence of said 
mutation. 

12. The method of claim 1, 2, 3, 6 or 7, wherein said presence or absowe of said mutation 
is detected by determining a nucleic acid sequence encodmg said mutation. 

13. The method of claim 2 or 7, wherein the individual is undergoing or has undergone 
prior treatment with an anti-viral drug. 

14. The method of claim 1 or 2, wherem the amino acid at position 1 1 of said protease is 
an amino acid having a neutral, hydrophobic or non-polar side chain. 

ll The method of claim 1 or 2, wherein the ammo acid at position 1 1 of said protease is I 
orL. 

16. Hie method of claim 1 or 2, wherein the amino acid at position 33 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chain. 

17. The mefliod of claim 1 or 2, wherein the amino acid at position 33 of said protease 
isF. 

18. The method of claim 1 or 2, wherein the amino acid at position 34 of said protease is 
an amino acid having a neutral, polar or hydrophilic side chain. 

19. The method of claim 1 or 2. wherein the amino acid at position 34 of said protease 
isQ. 
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20. The method of claim 1 or 2, wherein the amino acid at position 43 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chain. 

21. The method of claim 1 or 2, wh«:ein the amino acid at position 43 of said protease 
isT. 

22. The method of claim 1 or 2, wherein the amino acid at position 48 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chain. 

23. The method of claim 1 or 2, wherein the amino add at position 48 of said protease 
is M. 

24. The method of claim 1 or 2, Mdierein the amino acid at position 54 of said protease is 
an amino acid with a neutral, hydrophobic, non-polar, hydrophilic or polar side chain. 

25. The method of claim 1 or 2, wherein the amino acid at position 54 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side diain. 

26. The method of claim 1 or 2, wherein the amino add at position 54 of said protease 
is A. 

27. The mefliod of claim 1 or 2, wherein the amino add at position 54 of said protease is 
an amino add with a neutral, hydrophilic or polar side chain. 

28. The method of claim 1 or 2, wherein the amino add at position 54 of said protease 
isSorT. 

29. Hie mettiod of claim 1 or 2, wherein the amino acid at position 58 of said protease is 
an amino acid with an acidic, hydrophilic or polar side chain. 

30. The method of claim 1 or 2, wherein the amino add at position 58 of said protease 
isE. 

31. The method of claim 1 or 2, wherein the amino acid at position 71 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chaiiL 
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32. The method of claim 1 or 2, whereiii the amino add at position 71 of said protease 
isL. 

33. The method of claim 1 or 2, wherein the amino add at position 76 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chain. 

34. The method of claim 1 or 2, viiaein Hhe amino add at position 76 of sdd protease 
isV. 

35. The method of claim 1 or 2, wherein the amino add at position 79 of said protease is 
an amino acid with a neutral, hydrophobic, non-polar, acidic, hydrophiKc w polar 
side chain. 

36. The method ofclaim lor 2, wherein the amino add at position 79 ofsaid protease is 
an amino acid with a neutral, hydrophobic or nrai-polar side diain. 

37. The method ofclaim 1 or 2, wherein the amino add at position 79 ofsaid protease is 
an amino acid with an acidic, neutral, hydrophilic or polar side chain. 

38. The method ofclaim 1 or 2, wherem the ammo add at position 79 ofsaid protease is 
any amino acid, with the proviso that it is not P. 

39. The method ofclaim I or 2, wherein the amino add at position 82 ofsaid protease is 
an amino acid with a neutral, hydrophobic or non-polar side chain. 

40. The method ofclaim 1 or 2, wherein the amino add at position 82 ofsaid protease 
is A or F. 

41 . The method ofclaim 1 or 2, wherein the amino add at position 83 ofsaid protease is 
an ammo acid widi an acidic, hydn^AiKc or polar side chain. 

42. The method ofclaim 1 or 2. wherdn the amino add at position 83 ofsaid protease 
isD. 

43. The method ofclaim 1 or 2. wherdn the ammo acid at position 54 ofsaid protease is 
an amino acid with a neutral, hydrophobic, non-polar, hydrophihc or pohff side chain. 
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44. The method of claim 1 or 2, wherein the amino acid at position 84 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chain, 

45. The method of claim 1 or 2, wherein the amino add at position S4 of said protease 
is A. 

46. The method of claim 1 or 2, iT^erdn the amino acid at position 84 of said protease is 
an amino acid with a neutral, hydmphilic or polar side chaiiL 

47. The mediod of claim 1 or 2, ^vtiierein the amino acid at position 84 of said protease 
isC 

48. The method of claim 1 or 2, wherein the amino acid at position 91 of said protease is 
an amino acid with a neutral, hydrophobic, non-polar, hydrophilic or polar side chain. 

49. The method of claim I or 2, wherein the amino acid at position 91 of said protease is 
an amino acid with a neutral, hydrophobic or non-polar side chaiiL 

50. The method of claim 1 or 2, wherein the amino acid at position 91 of isaid protease 
is A or V. 

51 . The method of claim 1 or 2, wherein the amino add at position 91of said protease is 
an airiino add with a neutral, hydrophilic or polar side chain. 

52. The method of claim 1 or 2, wherein the amino add at position 91 of said protease 
isS. 

53. The method of claim 1 or 2, wherein the amino acid at position 95 of said protease is 
an amino acid with a neutral» hydrophobic or non-polar side chain. 

54 The method of claim 1 or 2, wherein the amino add at position 95 of said protease 
isF. 

55. The method of claim 1, 2, 3, 6 or 7, wherein the method comprises detecting the 

presence or absence of a mutation associated with reduced susceptibility to treatment 
with said protease inhibitor at at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 1 8 or 1 9 of the amino acid positions. 



-71- 



wo 2004/003514 



PCT/US2003/021335 




1/7 

SUBSTITUTE SHEET (RULE 26) 



wo 2004/003514 



PCT/US2003/021335 




111 

SUBSTITUTE SHEET (RULE 26) 



wo 2004/003514 



PCT/US2003/02133S 



SEQ. ID. NO: 1: NL4^ HiV Protease Amino Add Sequence 

PQIUWQWLVnKIGGQLKEAIiDTGADDmEEMNLreiMP^^^^ 
GGFIKVRQYDQIUEICGHKAIGMVGPTPVNIIGRNLLTQIGCTINF 

Fig. 3 A 



SEQ. ID. NO: 2: NL« HIV Protease Nudeoticle Sequence 

I- 10 cctcagatcactctttggcagcgacccdc 

II- 20 gtcacaataaagataggggggcaattaaag 
21-30 gaagctctattagatacaggagcagatgat 
31-40 acagtattagaagaaatgaatttgocagga 
41-50 agatggaaaocaaaaatgatagggggaatl 
51-60 ggaggttttatcaaagtaagacagtatgat 
61-70 cagatadcatagaaatctgcggacataaa 
71-80 gctalaggtacagtattagtaggacctaca 
81-90 cctgtcaacataatlggaagaaatctgttg 
91-99 actcagattggctgcactttaaatttt 

Fig. 3B 
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SEQUENCE LISTING 

<110> ViroLogic, Inc 

<120> COMPOSITIONS AND METHODS FOR DETERMINING THE SUSCEPTIBILITY OF A 
PATHOGENIC VIRUS TO PROTEASE INHIBITORS 

<130> 11068-065-228 

<140> 
<141> 

<150> 60/393,248 

<151> 2002-07-01 

<150> 60/414,273 
<151> 2002-09-27 

<160> 2 

<170> Patentin version 3.2 

<210> 1 
<211> 99 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 1 

Pro Gin He Thr Leu Trp Gin Arg Pro Leu Val Thr He Lys He Gly 
15 10 15 

Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 25 30 

Leu Glu Glu Met Asn Leu Pro Gly Arg Trp Lys Pro Lys Met He Gly 
35 40 45 

Gly He Gly Gly Phe He Lys Val Arg Gin Tyr Asp Gin He Leu He 
50 55 60 

Glu He Cys Gly His Lys Ala He Gly Thr Val Leu Val Gly Pro Thr 
65 70 75 80 

Pro Val Asn He He Gly Arg Asn Leu Leu Thr Gin He Gly Cys Thr 
85 90 95 

Leu Asn Phe 



<210> 2 

<211> 297 

<212> DNA 

<213> Human immunodeficiency virus 
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ciSoagatca ctctttggca gcgacccctc gtcacaataa agataggggg gcaattaaag 
gaagctctat tagatacagg agcagatgat acagtattag aagaaatgaa tttgccagga 
agatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat 
cagatactca tagaaatctg cggacataaa gctataggta cagtattagt aggacctaca 
cctgtcaaca taattggaag aaatctgttg actcagattg gctgcacttt aaatttt 
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180 
240 
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